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Introduction 

Estrogen  receptor  (ER)  modulators,  such  as  Tamoxifen,  represent  the  most  widely  employed 
therapy  for  breast  cancer.  However,  the  resistance  of  estrogen  receptor-negative  tumor  cells  to 
this  treatment,  the  possibility  that  Tamoxifen  may  increase  the  chances  of  endometrial  carcinoma 
and  the  development  of  Tamoxifen-resistant  breast  cancer  cells  in  many  patients  result  in  a  need 
to  improve  therapeutic  strategies  (see  Brown  and  Lippman,  2000,  for  an  updated  comprehensive 
clinical  review  of  breast  cancer  prevention  studies). 

Retinoids,  the  natural  vitamin  A  derivatives  and  synthetic  analogs,  are  known  to  inhibit  the 
growth  of  a  wide  variety  of  cancer  cells,  including  breast  cancer  cells  (Fitzgerald  et  al,  1997; 
Giannini  et  al,  1997;  Budd  et  al.,  1998;  Tacher  et  al.,  2000),  constituting  a  novel  and  promising 
approach  in  the  prevention  and  treatment  of  the  disease.  Retinoids,  like  steroid  hormones  and 
certain  vitamins,  enter  target  cells  and  bind  to  cognate  members  of  a  large  family  of  Nuclear 
Receptors  (NRs)(see  Mangelsdorf  and  Evans,  1995;  Weatherman  et  al.,  1999  for  review),  which 
activate  or  repress  transcription  in  a  ligand-dependent  manner  by  binding  to  their  cognate 
response  element  located  in  the  promoter  region  of  the  target  gene.  Two  types  of  retinoid 
receptors  have  been  identified:  the  Retinoid  Acid  Receptor  (RAR)  and  the  Retinoid  X  Receptor 
(RXR),  each  composed  of  three  isotypes  a,  P  and  y.  Retinoids  exert  their  anticancer  activities 
both  in  estrogen  receptor-positive  and  -negative  ceUs,  mainly  through  their  abilities  to  modulate 
the  growth,  differentiation,  and  apoptosis  of  tumor  cells.  In  spite  of  their  therapeutic  potential, 
their  clinical  use  is  so  far  limited,  because  of  toxic  side  effects  (Rizvi  et  al.,  1998). 

Consequently,  the  identification  of  structural  motifs  of  small  molecule  modulators  of  the  retinoid 
receptors  is  critical  for  the  development  of  novel  ligands  with  improved  toxicity  profiles,  which 
could  lead  the  way  towards  a  new  generation  of  drugs  against  breast  cancer.  Our  goal  is  to  use 
state-of-the-art  computer  modeling  technologies  to  identify  the  structural  determinants  of 
retinoids  which  confer  them  their  receptor  selectivity  and  their  agonist  or  antagonist  activity,  and 
rationally  design  molecules  with  various  affinities  for  the  RARs  and  the  RXRs. 

Body 

Task  1:  Identify  the  structural  determinants  which  confer  a  ligand  retinoid  X  receptor  (RXR)  or 
retinoic  acid  receptor  (RARl  selectivity 

Using  the  structure  of  RAR  ligand  binding  domain  (LBD)  bound  to  All-Trans  Retinoic  Acid 
(ATRA)  we  docked  9-cis  retinoic  acid  (9-cis  RA)  in  the  ligand  binding  pocket  of  the  receptor. 
The  ligand  and  receptor  side  chains  were  flexible  in  our  improved  docking  algorithm  (Totrov  and 
Abagyan,  1994;  see  also  Totrov  and  Abagyan,  2001,  for  review). 

Using  the  crystal  structure  of  the  unligated  RXR  and  the  result  from  the  previous  step,  we  were 
able  to  build  a  model  of  the  RXR  LBD  bound  to  9-cis  RA,  and  docked  RXR  and  RAR  ligands  to 
the  two  receptors.  We  predicted  that  the  ligand  binding  pocket  of  RXR  is  shorter  and  bulkier 
than  that  of  RAR,  which  accounts  for  receptor  selectivity.  Our  prediction  was  confirmed  by  the 
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crystal  structure  of  the  RXR/9-cis  RA  complex  (Gampe  et  al.,  2000).  The  binding  pocket  shape 
determines  the  specificity  of  the  interaction,  more  than  the  hydrogen-bond  network,  so  steric 
parameters  are  critical  for  receptor  selectivity. 

We  also  docked  RXR-  and  RAR  selective  ligands  into  the  receptors  ligand  binding  pockets  and 
showed  that  our  models  correlated  with  experimental  observations:  non-binders  had  a  positive 
calculated  Van  der  Waals  energy  of  interaction,  whereas  binders  had  a  negative  Van  der  Waals 
energy  of  interaction  (see  Report  1  for  details).  This  correlation  between  ligand  binding 
modeling  and  experiments  was  an  encouraging  indication  of  the  validity  of  our  methodology. 

An  increasing  amount  of  data  suggest  that  the  RARa  isoform,  which  controls  the  expression  of 
RARP,  is  the  most  relevant  target  for  breast  cancer  therapy  (Liu  et  al.,  1996;  Widschwendter  et 
al.,  1997).  A  detailed  understanding,  at  the  atomic  level,  of  the  determinants  for  isoform 
specificity  is  therefore  important  to  design  ligands  with  appropriate  receptor  selectivity.  We 
docked  a  RARa  specific  ligand,  Am580,  in  the  crystal  structure  of  RAR  y,  and  in  a  model  that 
we  built  of  RARa  and  RARp,  and  showed  that  residue  234,  one  of  the  three  residues  of  the 
ligand  binding  pocket  which  are  not  conserved  among  the  three  RAR  isoforms,  was  responsible 
for  receptor  isoform  specificity  (Figure  1). 


Task  2:  Identify  the  structural  properties  a  ligand  must  satisfy  to  be  an  agonist 

Our  original  strategy  was  to  model  the  interaction  between  RXR  and  the  TATA  binding  protein 
(TBP),  which  is  believed  to  act  as  a  coactivator.  However,  key  data  regarding  the  identification, 
activity  and  structure  of  NRs  coactivators  (Freeman,  1999)  and  the  structural  mechanism  of 
agonist  and  antagonist  activity  revealed  by  a  series  of  crystal  structures  (Moras  and  Gronemeyer, 
1998)  have  been  published  since  and  enabled  us  to  refine  our  approach:  Upon  binding  of  an 
agonist,  the  C-terminal  helix  12  of  the  LBD  undergoes  a  conformational  change,  closes  like  a  lid 
on  the  ligand,  and  forms  a  hydrophobic  pocket  on  the  surface  of  the  receptor.  This  pocket  is  the 
target  for  several  known  and  unknown  coactivator  proteins,  which  bind  the  LBD  through  a 
conserved  LxxLL  motif.  In  the  antagonist-bound  conformation,  the  helix  HI 2  is  pushed  away 
from  the  ligand  binding  pocket,  and  occupies  the  coactivator  binding  site,  preventing  coactivator 
recruitment. 

We  collaborated  with  Dr  Herbert  Samuel’s  lab,  at  NYU  Medical  Center,  to  identify  a  novel 
coactivator,  NRIF3,  mediating  functional  specificity  of  nuclear  hormone  receptors.  This  novel 
protein  coactivates  RXR  but  not  RAR.  Our  modeling  suggests  that  NRIF3  binds  to  liganded 
receptors  through  an  LxxIL  module  contained  within  the  C-terminal  domain  of  the  coactivator. 
This  data  was  the  object  of  an  article  published  in  Molecular  and  Cellular  Biology  and  is 
appended  to  this  report  (Article  1). 

Very  interesting  data  recently  published  on  the  structure-activity  relationship  of  another  nuclear 
receptor,  the  estrogen  receptor,  encouraged  us  to  initiate  an  additional  avenue  of  investigation  on 
RAR,  with  possible  important  applications  in  breast  cancer  therapy.  Paige  et  al.  showed  that 
different  agonists  of  the  estrogen  receptor  (estradiol,  estriol,  and  diethylstilbestrol)  could  induce 
different  conformational  changes  of  the  receptor  LBD,  resulting  in  recruitment  of  different 
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coactivators  (Paige  et  al.,  1999).  Structural  analysis  of  many  nuclear  receptors  suggests  that, 
upon  ligand  binding,  the  core  of  the  receptor  LED  remains  fixed,  while  the  C-terminal  HI 2  helix 
is  relocated  and  interacts  with  the  ligand,  thus  generating  a  binding  site  for  coactivator  proteins 
(Moras  and  Gronemeyer,  1998;  Freeman,  1999).  It  is  therefore  reasonable  to  predict  that  the 
different  estrogen  receptor  agonists  induce  a  different  rearrangement  of  the  H12  helix. 

We  built  low  energy  models  of  the  RAR  LED  where  the  HI 2  helix  is  relocated  onto  different 
docking  sites  at  the  surface  of  the  receptor.  Only  a  limited  number  of  rearrangements  of  H12  are 
possible  (Figure  2).  Such  molecules  could  have  a  different  pharmacological  profile,  compared 
with  existing  RAR  ligands. 

Task  3:  Design  rationally  and  test  novel  drugs  against  breast  cancer 

1-  Use  our  model  to  screen  libraries  of  compounds  and  select  ligands  with  desired  structural 
characteristics: 

Eased  on  the  crystal  structure  of  estrogen  receptor  bound  to  Tamoxifen,  we  built  a  model  by 
homology  of  antagonist-bound  RAR  and  carried  out  a  high-throughput  computer  screening 
where  over  150,000  ligands  were  automatically  docked  into  the  receptor’s  binding  pocket  and 
assigned  a  score  according  to  the  quality  of  the  fit  (see  Schapira  et  al.,  1999;  Abagyan  and 
Totrov,  2001,  for  details  on  the  methodology).  About  thirty  RAR  antagonist  candidates  were 
selected  for  in  vitro  assay.  This  work  was  published  in  2000  in  the  Proceedings  of  the  National 
Academy  of  Sciences  and  is  appended  to  this  report  (Article  2). 

We  have  also  used  a  benchmark  virtual  library  made  of  the  structure  of  known  ligands  for 
different  members  of  the  nuclear  receptor  family  in  order  to  optimize  our  virtual  library 
screening  technology.  We  could  show  that  after  four  independent  screenings,  all  the  RAR 
ligands,  and  only  the  RAR  ligands  were  selected  (Table  1),  which  constitutes  and  important 
validation  of  our  approach.  This  results  were  also  published  (Article  2). 

Additionally,  we  have  used  the  crystal  structure  of  agonist-bound  RAR  to  screen  our  virtual 
library  of  over  150,000  ligands  and  selected  about  30  agonist  candidates  to  be  tested  in  vitro. 
This  work  has  been  published  in  BMC  Structural  Biology,  and  is  appended  to  this  report  (Article 
3). 

Little  is  known  about  the  natural  ligands  to  RXR.  Crystallographic  data  of  RXRa  bound  to 
agonist  9-cis  RA  were  published  recently  (Egea  et  al,  2000;  Gampe  et  al.,  2000).  Also,  fatty 
acids  were  found  in  the  ligand  binding  pocket  of  murine  RXR  (Eourguet  et  al,  2000;  Urquiza  de 
et  al.,  2000).  Aiming  first  at  the  discovery  of  potential  natural  ligands  of  RXRa,  we  docked 
4,000  natural  occurring  compounds  to  the  ligand  binding  pocket  of  RXRa  in  the  agonist 
conformation.  Four  different  calculations  were  performed,  and  the  lowest  score  for  each 
compound  is  shown  in  Figure  3a.  Control  known  binders  to  RXR  like  9-cis  RA  and  Targretin 
and  not  binders  like  hydroxytamoxifen  and  MPA  were  correctly  predicted  by  our  scoring 
protocol  (see  Figure  4).  According  to  Figure  3a,  we  considered  that  compounds  with  a  score  less 
than  -50.  may  be  candidates  for  experimental  testing  (see  Figure  3b).  These  results  are 
confidential  and  they  should  be  protected. 
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2-  Test  these  ligands  in  vitro  for  their  receptor  selectivity,  and  their  ability  to  trigger 
transactivation 

Activity  tests  were  performed  on  the  set  selected  after  virtual  screening  procedure,  as  described 
in  1-.  Two  novel  RAR  antagonists  were  discovered,  thus  validating  our  modeling  strategy 
(Article  2). 

We  also  discovered  novel  agonists  of  RAR  (Article  3).  One  of  them,  agonist  3,  presents  very 
original  structural  features:(i)  penta-methylated  benzene  linked  by  a  ketone  to  a  second  aromatic 
ring,  forms  a  very  large  hydrophobic  head,  (ii)  pyridine  probably  makes  a  strong  interaction  with 
Cys  237  and  replaces  a  carboxylate  or  ester  present  in  all  RAR  ligands  described  so  far  (Figure 
5).  These  novel  structural  features  may  translate  into  improved  toxicity  profiles  and  result  in  the 
development  of  new  ligands  for  breast  cancer  therapy. 

This  last  finding  is  extremely  encouraging,  and  we  initiated  a  collaboration  with  Prof.  Stephen 
Wilson  and  Prof.  Herbert  Samuels  to  discover  analogs  of  agonist  3  with  improved  activity.  Dr 
Wilson  is  Professor  of  chemistry  at  New  York  University  and  the  director  of  the  New  York 
University  Laboratory  for  Combinatorial  Chemistry.  He  is  a  renown  expert  in  solid  phase 
chemistry,  and  the  author  of  Combinatorial  Chemistry:  Synthesis  and  Application,  John  Wiley  & 
Sons,  March  1997;  edited  by  Wilson  &  Czamik,  one  of  the  first  books  on  combinatorial 
chemistry  and  its  organic  chemistry  applications.  Prof.  Samuels  is  an  internationally  recognized 
expert  in  endocrinology,  and  has  been  collaborating  with  us  since  the  beginning  of  our  research 
effort. 

RAR  ligands  described  in  the  literature  belong  to  limited  series  of  related  structures,  and  display 
similar  toxic  side  effects.  The  discovery  of  agonist  3  and  the  model  of  its  interaction  with  RAR 
open  an  avenue  towards  new  RAR  agonist  ligands  with  possibly  more  favorable  specificity  and 
toxicity  profiles. 


Key  Research  Accomplishments 

-We  built  a  model  of  the  active  conformation  of  RXR,  which  was  recently  confirmed  by  a  crystal 
structure  (Gampe  et  al.,  2000),  and  could  derive  the  structural  differences  between  the  RAR  and 
RXR  binding  pockets  responsible  for  ligand  selectivity. 

-We  modeled  the  interaction  between  known  RAR-  and  RXR-  ligands,  observing  a  very  good 
correlation  between  our  models  and  biochemical  data. 

-We  provided  structural  determinants  for  RAR  isoforms  selectivity  (Figure  1). 

-We  modeled  the  interaction  between  RAR  or  RXR  and  NRIF3,  a  novel  nuclear  receptor 
coactivator  (Article  1). 

-We  discovered  novel  RAR  antagonists  (Article  2). 
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-We  optimized  our  virtual  screening  technology  and  demonstrated  its  efficacy  at  discriminating 
between  binders  and  non-binders  (Table  1  and  Article  2). 

-We  applied  our  virtual  ligand  screening  procedure  to  find  candidates  among  natural  occuring 
compounds  which  may  bind  to  RXR. 

-We  discovered  an  RAR  agonist  with  very  original  structural  features,  and  initiated  a 
collaboration  with  chemists  (Prof.  S.  Wilson’s  laboratory)  and  biologists  (Prof.  H.  Samuels’ 
laboratory)  to  design,  synthesize  and  test  structural  analogs  of  this  very  promising  molecule 
(Figure  3  and  Article  3). 


Reportable  Outcomes 

Article  1: 

NRIF3  is  a  novel  co-activator  mediating  functional  specificity  of  nuclear  hormone  receptors 
D.  Li,  V.  Desai-Yajnik,  E.  Lo,  M.  Schapira,  R.A.  Abagyan,  and  H.H.  Samuels 
Molecular  and  Cellular  Biology  1999, 19  (10),  7191-7202. 

Article  2: 

Rational  discovery  of  novel  nuclear  receptor  antagonists 

M.  Schapira,  B.  Raaka,  H.H.  Samuels,  and  R.A.  Abagyan 

Proceedings  of  the  National  Academy  of  Sciences  USA.  2000,  97  (3):  1008-13. 

Article  3: 

In  Silico  discovery  of  novel  retinoic  acid  receptor  agonist  structures 
M.  Schapira,  B.  Raaka,  H.H.  Samuels,  and  R.A.  Abagyan 
BMC  Structural  Biology  2001, 1,  1. 

Abstract  1: 

Structural  and  functional  mechanisms  of  the  Retinoid  receptors  addressed  by  a  novel  computer 
approach 

M.  Schapira  and  R.A.  Abagyan 

Presented  at  the  “Era  of  Hope”  meeting,  Atlanta,  GA,  June  8-11,  2000. 


Conclusions 

During  the  three  years  of  our  effort,  we  have  accomplished  most  of  the  tasks  outlined  in  the 
approved  Statement  of  Work,  despite  a  difficult  move  ot  the  laboratory  from  the  NYU  Medical 
Center  to  the  Scripps  Research  Institute.  We  have  identified  the  structural  determinants  which 
confer  a  ligand  R^  or  RAR  selectivity,  and  successfully  challenged  our  models  with  known 
ligands  for  either  receptor.  We  have  also  put  a  rational  on  RAR  isoforms  specificity  which  will 
be  useful  to  develop  RARa  specific  agonists  -the  biological  activity  which  seems  the  most 
promising  for  breast  cancer  therapy. 
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Based  on  structural  data  on  the  estrogen  receptor,  we  also  produced  models  of  distinct  RAR 
conformations,  which  may  be  used  to  derive  novel  types  of  ligands  for  this  receptor.  Such 
ligands  could  have  interesting  activities,  mirroring  the  complexity  of  small  molecule  estrogen 
receptor  modulators. 

We  have  developed  a  strategy  to  rationally  design  RAR  antagonists,  and  were  also  able  to 
discover  novel  RAR  agonists  with  very  original  structural  features.  Analogs  of  these  novel 
molecules  could  result  in  the  identification  of  leads  against  breast  cancer.  This  task  is  currently 
on  its  way.  Virtual  ligand  screening  was  also  used  to  find  potential  natural  ligands  of  RXRa  with 
promising  results.  Candidates  should  be  synthetized  and  tested  in  vitro  and  in  vivo. 

"So  what": 

We  have  developed  a  series  of  approaches  to  identify  new  antagonists  and  new  agonists  for  the 
retinoid  receptors,  based  on  the  structure  of  the  receptor,  and  regardless  of  existing  compounds. 
This  original  strategy  already  resulted  in  the  discovery  of  active  ligands  presenting  very  original 
features.  Collaboration  with  chemists  and  biologists  to  develop  analogs  of  these  molecules  with 
improved  activity  and  specificity,  which  would  constitute  valuable  leads  for  breast  cancer 
therapy  is  presently  on  its  way. 
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Figures  and  Tables 
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Figure  1:  Docking  of  a  known  RARa  specific  agonist.  The 
RARa  selective  agonist  Am580  was  docked  into  the  modeled 
ligand  binding  pocket  of  RARa.  A:  The  complex(?)  ligand 
(white  sticks)  superimposes  with  the  crystal  structure  of 
bound  all-trans  RA  (green).  Hydrogens  are  not  shown  for 
clarity.  B:  Am580  (CPK  display)  fits  tightly  into  the  receptor’s 
pocket  (yellow  wire),  except  for  a  ketone  oxygen,  which  does 
not  clash  with  the  receptor  only  because  it  can  share  an 
hydrogen  with  Ser234  (displayed  as  stick).  In  RARP  and  y, 
Ser234  are  replaced  by  an  alanine,  which  clashes  with  the 
ketone  oxygen.  The  receptor  in  the  vicinity  of  the  ligand  is 
shown  as  a  white  ribbon.  Carbons,  hydrogens,  oxygens  and 
nitrogen  are  colored  white,  grey,  red  and  blue  respectively. 
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Figure  2:  Models  of  RAR  LED  where  the  C-terminal  helix  H12  (green,  magenta)  is 
relocated  onto  different  docking  sites  of  the  receptor  (surface  representation)  could 
provide  novel  binding  pocket  conformations  for  the  structure  based  discovery  of 
agonists  (cyan)  with  an  improved  pharmacological  profile. 
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Ligand 

Activity  Scorel 

Scorel 

Scores 

Scored 

Selected 

Binding 

AGN193836RARa  agonist  -37.2 

-36.5 

-36.7 

-35.3 

+ 

+ 

ATRA 

RAR  pan-agonist  -51.7 

-52.6 

-51.8 

-52.0 

+ 

+ 

Ro415253 

RARa  antagonist  -28.9 

-24.4 

-39.0 

-46.6 

+ 

+ 

MX781 

RAR  antagonist  -45.3 

-48.0 

-40.2 

-45.6 

+ 

+ 

CD2366 

RAR  pan-  ^ 

antagonist. 

-50.8 

-29.3 

-29.3 

+ 

+ 

Targretin 

RXR  pan-agonist  -25.4 

-23.0 

-22.2 

-31.0 

- 

- 

SRI  1203 

RXR  pan-agonist  -28.2 

-22.7 

-22.1 

-27.5 

- 

- 

Tamoxifen 

ER  modulator  -26.4 

-24.6 

-30.3 

-23.4 

- 

- 

Raloxifene 

ER  modulator  -15.6 

-23.7 

-18.4 

-17.4 

- 

- 

RU486 

Progest  Rec  antag.  -21.4 

-20.6 

-20.3 

-20.1 

- 

- 

9cisRA 

RAR/RXR  28  8 

agonist. 

-39.5 

-33.5 

-38.7 

+ 

+ 

AGN193 109*^  -55. 1 

antagonist. 

-55.5 

-41.2 

-54.8 

+ 

+ 

AGNpartia 

RAR  partial  ^ 

agonist. 

-61.3 

-61.4 

-61.0 

+ 

+ 

Am580 

RARa  agonist  -46.6 

-47.2 

-46.6 

-46.5 

+ 

+ 

EM652 

ER  antagonist  -26.3 

-23.1 

-23.7 

-27.3 

- 

- 

Antagonist!  Novel  RAR  antag  -32.1 

-32.1 

-31.7 

-31.6 

+ 

+ 

Antagonist2  Novel  RAR  antag  -33.3 

-29.7 

-33.8 

-33.8 

+ 

+ 

Table  1:  Control  screening  of  known  ligands.  A  similar  screening  as  the  one  performed  on  the 
large  library  was  carried  out  four  times  on  a  small  database  made  of  known  nuclear  receptor 
ligands:  the  compounds  were  automatically  docked  in  the  structure  of  RAR.  The  molecules 
which  met  at  least  once  with  the  criteria  for  selection  are  listed  as  "Selected".  The  ligands  which 
are  experimentally  binding  to  RAR  are  listed  as  "Binding". 
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(b) 


Figure  3:  Virtual  ligand  screening  scores  for  natural  occuring  ligands  docked  to  RXRa.  Figure 
(a)  shows  the  scores  of  all  molecules  in  the  set,  and  (b)  the  best  scores. 
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MPA 


Hydroxitamoxifen 


Figure  4:  Ligands  docked  to  holo-RXRa.  9cis-RA  and  Targretin  are  known  ligands  to  RXR 
Hydroxitamoxifen  is  a  Estrogen  Receptor  binder  and  MPA  an  Androgen  Receptor  binder 
which  do  not  bind  to  RXR. 


Figure  5:  Unlike  all  retinoid  receptor  ligands  described  in  the  literature,  the  discovered 
Agonist  3  (green)  does  not  have  carboxylate,  but  pyridine  which  makes  thiolate  salt  with 
cystein  (white)  of  the  receptor,  and  could  also  interact  with  neighboring  arginine  (white).  This 
compound  is  active  at  200  nM,  and  the  rational  design  of  analogs,  which  conserve  this  original 
feature  could  result  in  leads  against  breast  cancer  with  improved  specificity  and  toxicity 
profiles.  We  embarked  on  the  development  of  such  molecules  in  collaboration  with  renowned 
chemists  and  biologists. 
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Toward  a  new  chemotherapy  for  breast  cancer:  Structural  and  functional  mechanism  of 
the  retinoid  receptors  addressed  by  a  novel  computer  approach. 
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Structural  and  Functional  Mechanism  of  the  Retinoid  Receptors  Addressed  by  a  Novel 

Computer  Approach 

Matthieu  Shapira  and  Ruben  Abagyan 

Era  of  Hope,  Atlanta,  GA,  June  2000. 

Unlike  antiestrogens,  such  as  tamoxifen  or  raloxifene,  which  can  be  effective  against 
estrogen-dependent  breast  tumors,  compounds  targeted  at  the  Retinoid  Receptors  (RR), 
members  of  the  Nuclear  Receptors  (NR)  family,  can  be  active  against  both  estrogen 
receptor  (ER)  positive  and  negative  himors.  As  a  consequence,  understanding  the 
structural  mechanism  of  NR  in  general,  and  RR  in  particular,  could  help  design  and 
optimize  a  novel  generation  of  chemotherapeutic  agents  against  breast  cancer. 

The  RR  are  divided  into  two  protein  subfamilies,  the  Retinoic  Acid  Receptor  isoforms 
(RAR-alpha,  -beta,  and  -gamma)  and  the  Retinoid  X  Receptor  isoforms  (RXR-alpha,  - 
beta,  and  -gamma).  It  is  today  not  clear  which  receptor  and  which  isoform  is  the  best 
target  for  breast  cancer  therapy.  As  a  result,  it  is  important  to  test  both  agonists  and 
antagonists  targeted  at  various  isoforms  of  RAR  and  RXR.  A  detailed  understanding  of 
(i)  the  structural  mechanism  of  activation/inhibition  of  NR  and  (ii)  the  structural 
determinants  of  RR  speeificity  is  important  to  designing  modulators  with  required 
activity.  Our  goal  is  to  use  a  powerful  computer  modeling  technology  to  address  these 
two  questions. 

We  hypothesized  that  the  structural  mechanism  of  antagonist  activity  revealed  by  the 
tamoxifen  crystal  structure  could  be  extended  to  other  members  of  the  NR  family,  and 
use  this  structure  as  a  template  to  build  a  model  of  RAR  in  an  antagonist-bound 
confirmation.  Known  RAR  antagonists  docked  into  the  modeled  pocket  displayed  a 
perfect  fit.  This  3D  model  of  inactive  RAR  was  then  used  to  discover  novel  RAR 
antagonists:  a  high  throughput  virtual  screening  algorithm  was  used  to  dock  rapidly  a 
database  of  200,000  virtual  structures  into  our  modeled  RAR-antagonist  binding  site.  32 
compounds  were  actually  tested  in  vitro  and  2  novel  antagonists  were  discovered.  Three 
novel  agonists  were  also  discovered  using  the  same  approach. 

The  crystal  structure  of  the  ligand  binding  domain  (LED)  of  RAR  bound  to  its  natural 
agonist  is  known,  but  the  structure  of  active  RXR  is  not.  We  produced  a  model  of  RXR- 
LBD  bound  to  the  agonist  9-cis  Retinoic  Acid.  The  comparison  of  active  RAR  and  RXR 
Revealed  that  the  two  ligand  binding  pockets  are  equally  hydrophobic;  however,  the 
RXR  ligand  binding  pocket  is  shorter  and  bulkier,  and  can  accommodate  for  ligands 
presenting  a  more  distinct  shape  than  RAR  agonists. 

The  U.S.  Army  Medical  Research  and  Material  Command  under  DAMD17-98-1-8133 
supported  this  work. 
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Many  nuclear  receptors  are  capable  of  recognizing  similar  DNA  elements.  The  molecular  event(s)  underlying 
the  functional  specificities  of  these  receptors  (in  regulating  the  expression  of  their  native  target  genes)  is  a  very 
important  issue  that  remains  poorly  understood.  Here  we  report  the  cloning  and  analysis  of  a  novel  nuclear 
receptor  coactivator  (designated  NRIF3)  that  exhibits  a  distinct  receptor  specificity.  Fluorescence  microscopy 
shows  that  NRIF3  localizes  to  the  cell  nucleus.  The  yeast  two-hybrid  and/or  in  vitro  binding  assays  indicated 
that  NRIF3  specifically  interacts  with  the  thyroid  hormone  receptor  (TR)  and  retinoid  X  receptor  (RXR)  in  a 
ligand-dependent  fashion  but  does  not  bind  to  the  retinoic  acid  receptor,  vitamin  D  receptor,  progesterone 
receptor,  glucocorticoid  receptor,  or  estrogen  receptor.  Functional  experiments  showed  that  NRIF3  signifi¬ 
cantly  potentiates  TR-  and  RXR-mediated  transactivation  in  vivo  but  has  little  effect  on  other  examined 
nuclear  receptors.  Domain  and  mutagenesis  analyses  indicated  that  a  novel  C-terminal  domain  in  NR1F3  plays 
an  essential  role  in  its  specific  interaction  with  liganded  TR  and  RXR  while  the  N-termlnal  LXXLL  motif  plays 
a  minor  role  in  allowing  optimum  interaction.  Computer  modeling  and  subsequent  experimental  analysis 
suggested  that  the  C-terminal  domain  of  NRIF3  directly  mediates  interaction  with  liganded  receptors  through 
an  LXXIL  (a  variant  of  the  canonical  LXXLL)  module  while  the  other  part  of  the  NRIF3  protein  may  still  play 
a  role  in  conferring  its  receptor  specificity.  Identification  of  a  coactivator  with  such  a  unique  receptor  specificity 
may  provide  new  insight  into  the  molecular  mechanism(s)  of  receptor-mediated  transcriptional  activation  as 
well  as  the  functional  specificities  of  nuclear  receptors. 


Nuclear  hormone  receptors  are  ligand-regulated  transcrip¬ 
tion  factors  that  play  diverse  roles  in  cell  growth,  differentia¬ 
tion,  development,  and  homeostasis.  The  nuclear  receptor  su¬ 
perfamily  has  been  divided  into  two  subfamilies;  the  steroid 
receptor  family  and  the  thyroid  hormone/retinoid  (nonsteroid) 
receptor  family  (51).  The  steroid  receptor  family  includes  re¬ 
ceptors  for  glucocorticoids  (GR),  mineralcorticoids,  progestins 
(PRs),  androgens  (AR),  and  estrogens  (ERs)  (51).  The  non¬ 
steroid  receptor  family  includes  receptors  for  thyroid  hor¬ 
mones  (TRs),  retinoids  (retinoic  acid  receptors  [RARs]  and 
retinoid  X  receptors  [RXRs]),  l,25-(OH)2-vitamin  D  (VDR), 
and  prostanoids  (peroxisome  proliferator-activated  receptors 
[PPARs])  as  well  as  many  orphan  receptors  whose  ligands  (if 
any)  remain  to  be  defined  (49,  51).  Members  of  the  nuclear 
receptor  superfamily  have  common  structural  and  functional 
motifs.  Nevertheless,  an  important  difference  exists  between 
the  two  subfamilies.  Steroid  receptors  act  primarily  as  ho¬ 
modimers  by  binding  to  their  cognate  palindromic  hormone 
response  elements  (HREs)  (77,  78).  In  contrast,  members  of 
the  nonsteroid  receptor  family  can  bind  to  DNA  as  monomers, 
homodimers,  and  heterodimers  (25,  78).  Their  corresponding 
HREs  are  also  complex  and  can  be  organized  as  direct  repeats, 
inverted  repeats,  and  everted  repeats  (49).  Therefore,  the  com¬ 
bination  of  heterodimerization  and  HRE  complexity  provides 
the  potential  of  generating  enormous  diversity  in  receptor- 
mediated  regulation  of  target  gene  expression. 
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Structural  and  functional  studies  indicate  that  the  ligand 
binding  domain  (LBDs)  of  many  members  of  the  thyroid  hor¬ 
mone/retinoid  receptor  family  harbor  diverse  functions.  In  ad¬ 
dition  to  binding  to  ligands,  the  LBD  also  plays  roles  in  medi¬ 
ating  receptor  dimerization,  hormone-dependent  transactivation, 
and,  with  TR  and  RAR,  ligand-relieved  gene  silencing  (54, 61). 
The  carboxyl-terminal  helix  of  the  LBD  has  been  implicated  in 
playing  an  important  role  in  ligand-dependent  conformational 
changes  and  transactivation  (6, 9, 21, 43).  Although  it  has  been 
suggested  that  an  activation  function  (AF-2)  resides  in  this 
C-terminal  helix,  recent  studies  indicate  that  AF-2  results  from 
a  ligand-induced  conformational  change  involving  diverse  ar¬ 
eas  of  the  LBD  (23,  66).  Thus,  ligand  binding  serves  to  switch 
the  receptor  from  one  functional  state  (e.g.,  inactive  or  silenc¬ 
ing)  to  another  (e.g.,  transactivation). 

Although  much  has  been  learned  from  studying  the  struc¬ 
tures  and  functions  of  these  receptors,  the  detailed  molecular 
mechanism(s)  of  transcriptional  regulation  by  these  receptors 
is  not  well  understood.  Efforts  to  understand  the  molecular 
mechanism  of  transcriptional  repression  by  unliganded  TRs 
and  RARs  have  led  to  the  description  (12)  and  isolation  of  the 
putative  corepressor  proteins  SMRT  and  N-CoR,  which  inter¬ 
act  with  the  LBDs  of  these  receptors  in  the  absence  of  their 
ligands  (15,  36).  The  recent  discovery  that  both  SMRT  and 
N-CoR  form  complexes  with  Sin3  and  a  histone  deacetylase 
suggests  that  chromatin  remodeling  by  histone  deacetylation 
may  play  a  role  in  receptor-mediated  transcriptional  repression 
(33,  55). 

In  a  somewhat  parallel  approach,  the  identification  of  coac¬ 
tivators  has  recently  received  extensive  experimental  attention 
in  order  to  elucidate  the  molecular  mechanism(s)  of  transcrip¬ 
tional  activation  by  nuclear  receptors  (27).  Identified  coactiva- 
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tor  proteins  primarily  belong  to  two  groups:  the  SRC-1  family 
and  the  CREB-binding  protein  (CBP)/p300  family.  The  SRC-1 
family  includes  SRC-l^JCoA-1  (37,  58,  74)  and  the  related 
proteins  GRIPl/TIF2/NCoA-2  (34,  35,  74,  79)  and  AIBl/p/ 
CIP/ACrR/RAC3/TRAM-l  (2,  14,  44,  73,  74).  The  second 
group  of  coactivators  includes  CBP  and  its  homolog  p300, 
which  not  only  influence  the  activities  of  nuclear  receptors  (13, 
31,  37)  but  also  functionally  interact  with  many  transcription 
factors  such  as  CREB  (3, 16, 40, 46),  the  Stats  (10, 87),  API  (4, 
7),  and  p53  (28, 45).  There  are  also  coactivator  proteins  that  do 
not  belong  to  these  two  groups,  such  as  ARA70  (85),  PGC-1 
(60),  and  the  recently  reported  RNA  coactivator  SRA  (41). 
Members  of  both  the  SRC-1  family  and  CBP/p300  family  have 
been  shown  to  possess  histone  acetyltransferase  activities  (8, 
14,  57,  69),  suggesting  that  chromatin  remodeling  by  histone 
acetylation  is  an  important  mechanism  involved  in  transcrip¬ 
tional  activation  by  ligand-bound  nuclear  receptors. 

Interaction  of  members  of  the  SRC-1  and  CBP/p300  families 
with  nuclear  receptors  occurs  through  conserved  LXXLL  mo¬ 
tifs  (32),  which  interact  with  a  hydrophobic  cleft  in  the  receptor 
LBD  formed  as  a  result  of  conformational  changes  mediated 
by  ligand  binding  (19,  23,  56).  SRC-l/NCoA-1  and  GRIPl/ 
'nF2  contain  three  LXXLL  regions  or  boxes  (referred  to  as 
LXDs  or  nuclear  receptor  boxes)  that  differentially  interact 
with  nuclear  receptors  so  that  different  nuclear  receptors  func¬ 
tionally  utilize  difcrent  LXXLL  boxes  (19,  52).  Thus,  ER  uses 
the  second  LXXLL  box  of  SRC-l/NCoA-1  while  PR  uses  both 
the  first  and  second  LXXLL  boxes  for  optimal  interaction.  In 
contrast,  TR  and  RAR  require  both  the  second  and  third 
LXXLL  boxes  for  optimal  interaction  (52).  The  specificities  of 
receptor  recognition  by  the  different  LXJ^L  boxes  of  SRC-1/ 
NCoA-1  are  primarily  mediated  by  8  amino  acid  residues  C 
terminal  to  the  LXXLL  motif  rather  than  by  the  2  amino  acids 
(XX)  within  the  motif  itself.  Thus,  while  members  of  the 
SRC-1  family  are  capable  of  interacting  with  many  nuclear 
receptors,  the  molecular  details  of  such  interactions  differ  for 
each  receptor  in  the  number  or  combination  of  LXXLL  boxes 
used  as  well  as  in  the  critical  amino  acid  residues  surrounding 
the  LXXLL  motifs. 

While  much  has  been  learned  from  the  study  of  known 
coactivators,  a  number  of  key  mechanistic  questions  remain  to 
be  answered.  For  example,  many  nuclear  receptors  can  recog¬ 
nize  common  DNA  elements,  (25,  49,  51),  while  not  all  are 
capable  of  regulating  genes  containing  those  elements  (20,  47, 
65).  Thus,  how  native  target  genes  containing  such  elements 
are  selectively  regulated  by  specific  receptors  is  a  very  impor¬ 
tant  but  poorly  understood  problem.  Although  the  various 
LXXLL  boxes  of  SRC-1  and  GRIPl  show  differential  receptor 
preference  (19,  52),  these  coactivators  are  unlikely  to  play  a 
primary  role  in  mediating  effects  that  are  receptor  specific 
since  they  appear  to  interact  with  all  ligand-bound  nuclear 
hormone  receptors.  Thus,  the  detailed  molecular  mecha- 
nism(s)  underlying  receptor-specific  regulation  of  gene  expres¬ 
sion  remains  to  be  elucidated.  Whether  a  coactivator(s)  con¬ 
tributes  to  this  specificity  is  currently  unknown. 

To  further  our  understanding  of  the  molecular  events  un¬ 
derlying  receptor-activated  transcription,  we  sought  to  identify 
additional  coactivators  using  a  yeast  two-hybrid  screening 
strategy  (29).  In  this  paper,  we  report  the  isolation  of  a  novel 
coactivator  for  nuclear  receptors,  designated  NRIF3  (for  nu¬ 
clear  receptor-interacting  factor  3).  Fluorescence  microscopy 
indicates  that  NRIF3  is  a  nuclear  protein.  The  yeast  two-hybrid 
and  in  vitro  binding  assays  revealed  that  NRIR  interacts  spe¬ 
cifically  with  TR  and  RXR  in  a  ligand-dependent  fashion  but 
does  not  interact  with  other  examined  nuclear  receptors. 
Transfection  experiments  indicated  that  NRIF3  selectively 


potentiates  TR-  and  RXR-mediated  transactivation  in  vivo. 
The  NRIF3  gene  encodes  a  small  protein  of  177  amino  acids 
and,  other  than  having  an  N-terminal  LXXLL  motif,  has  no 
homology  with  known  coactivator  genes.  The  results  of  a  com¬ 
bination  of  computer  modeling  and  domain  and  mutagenesis 
analyses  suggest  that  NRIF3  interacts  with  nuclear  receptors 
through  its  C-terminal  domain  that  contains  a  novel  LXXIL 
module  while  another  part  of  NRIF3  may  contribute  to  its 
observed  receptor  specificity.  These  findings  may  provide  novel 
insight  into  the  molecular  mechanism(s)  of  receptor-mediated 
transcriptional  activation  as  well  as  the  functional  specificities 
of  nuclear  receptors. 

MATERIALS  AND  METHODS 

Isolation  of  NRIFs  and  the  yeast  two-hybrid  assay.  The  Brent  two-hybrid 
system  (29)  was  employed  to  isolate  candidate  cDNA  clones  interacting  with 
I^xA-TRo  in  a  ligand-dependent  fashion.  Full-length  chicken  TRa  (cTRa)  was 
fused  in  frame  to  the  C  terminus  of  the  LexA  DNA  binding  domain  (DBD)  in 
pEG202  (29).  The  LexA-TRa  bait,  the  LacZ  reporter  (pSH18-34),  and  a  pJG4- 
S-based  HeLa  cell  cDNA  library  were  transformed  into  the  yeast  strain  EGY48 
(29).  The  transformants  were  selected  on  Gal-Raf-X-Gal  (5-bromo-4-chloro-3- 
indolyl-p-D-galactopyranoside)  medium  in  the  absence  of  leucine  and  were  fur¬ 
ther  screened  for  the  expression  of  LacZ  in  the  presence  of  1  pM  T3.  Blue 
colonies  were  picked  and  reexamined  for  T3-dependent  expression  of  LacZ. 
Positive  yeast  clones  were  then  selected,  and  plasmids  harboring  candidate  prey 
cDNAs  were  isolated.  An  individual  candidate  prey  plasmid  was  then  amplified 
in  Escherichia  coli  and  retransformed  into  the  original  yeast  strain  to  confirm  the 
interaction  phenotype.  The  cDNA  inserts  were  then  sequenced  with  an  auto¬ 
matic  sequencer.  Four  novel  clones  (NRIFl,  -2,  -3,  and  -4)  were  obtained. 
Among  them,  NRIF3  was  a  full-length  clone. 

Wild-type  NRIF3,  the  B3-endonexin  long  form  (EnL)  and  short  form  (EnS), 
and  the  L9A  NRIF3  mutant  protein  were  examined  for  their  interaction  with 
various  nuclear  receptors  in  a  yeast  two-hybrid  assay.  The  following  receptor 
baits  were  used:  the  LexA-cTRo  LBD,  LexA-human  TRp  (hTRp)  LBD,  LexA- 
hRARa  LBD,  LexA-hRXRa  LBD,  and  UxA-hGR  LBD.  The  NRIF3  C-termi¬ 
nal  domain  (NCD)  was  fused  in  frame  with  the  LexA  DBD  and  examined  for 
interaction  with  receptor  LBDs  with  the  following  preys:  the  B42-cTRa  LBD, 
B42-hRARa  LBD,  and  B42-hRXRa  LBD  expressed  from  pJG4-S.  Yeast  cells 
harboring  appropriate  plasmids  were  grown  in  selective  media  with  Gal-Raf  in 
the  presence  or  absence  of  cognate  ligand  (1  pM  T3  for  TR,  all  Imns  or  9-cis  RA 
for  RAR,  9-cu  RA  for  RXR,  and  10  pM  deoxycorticosterone  for  GR)  overnight 
before  3-galactosidase  activity  was  assayed  with  o-nitrophenyl  0->galactopyr- 
anoside  as  the  substrate.  p-Galactosidase  units  were  calculated  with  the  formula 
(OD420  ^  l,000)/(minutes  of  incubation  x  ODeo,  of  yeast  suspension),  where 
OD420  and  ODhio  are  the  optical  densities  at  420  and  600  nm,  respectively. 

Fluorescence  microscopy.  Full-length  NRIF3  was  cloned  into  the  green  fluo¬ 
rescent  protein  (GFP)  fusion  protein  expression  vector  pEGFP  (Qontech).  The 
resulting  GFP-NRIF3  vector  and  the  control  plasmid  pEGFP  were  transfected 
into  HeLa  cells  by  calcium  phosphate  coprecipitation.  Cells  were  incubated  at 
37°C  for  24  h  before  the  examination  with  a  fluorescence  microscope  to  deter¬ 
mine  the  subcellular  location  of  GFP-NRIF3  or  the  GFP  control. 

In  vitro  binding  assay.  Full-length  NRIF3  was  cloned  into  pGEX2T,  a  bacte¬ 
rial  glutathione  S-transferase  (GST)  fusion  protein  expression  vector  (Pharma¬ 
cia).  The  GST-NRIF3  fusion  protein  was  expressed  in  E.  coli  and  affinity  purified 
with  glutathione-agarose  beads  (30).  ^’S-labeled  full-length  cTRa,  hRARa, 
hRXRo,  hVDR,  hGR,  hPR,  and  hER  were  generated  by  in  vitro  transcription 
and  translation  with  a  reticulocyte  lysate  system  (Promega).  Binding  was  per¬ 
formed  as  previously  described  (30)  with  the  following  buffer:  20  mM  HEPES 
(pH  7.9)-l  mM  MgQj-l  mM  dithiothreitol-10%  glycerol-0.05%  Triton 
X-IOO-1  pM  ZnQj-ISO  mM  KCI.  Appropriate  ligands  were  added  into  the 
binding  reaction  mixture  where  indicated  in  the  figures  in  the  following  concen¬ 
trations:  1  pM  T3  for  TR,  1  pM  all-tmnj  RA  or  9-cis  RA  for  RAR,  I  pM  9-cis 
RA  for  RXR,  and  150  nM  l,25-(OH)2-vitamin  Dj,  dexamethasone,  progester¬ 
one,  or  estradiol  for  VDR,  GR,  PR,  or  ER,  respectively.  After  the  binding 
reaction,  the  beads  were  washed  three  times  and  the  labeled  receptors  bound  to 
the  beads  were  examined  by  sodium  dodecyl  sulfate-10%  polyacrylamide  gel 
electrophoresis  followed  by  autoradiography.  Five  percent  of  the  ”S-labeled 
receptor  input  was  also  electrophoresed  in  the  same  gel. 

Transfection  studies.  Most  reporters  used  in  this  study,  including  IR-AMTV- 
CAT,  DR4-4MTV-CAT,  GH-TRE-tk-CAT,  and  lR+3  (ERE)-AMTV-CAT, 
have  been  described  previously  (5,  25,  78).  A  DRl-AMTV-CAT  reporter  re¬ 
sponsive  to  RXR  was  obtained  from  Ron  Evans.  A  GRE/PRE-tk-CAT  reporter 
was  obtained  from  Gunther  Schutz.  (IR)2-TATA-CAT  was  constructed  in  our 
laboratory  by  cloning  two  copies  of  the  inverted-repeat  (IR)  sequence  (AGG 
TCA  TGACCT)  upstream  of  a  TATA  element  derived  from  the  thymidine 
kinase  (tk)  promoter.  An  hVDR  expression  vector  and  VDRE-AMlV-CAT 
containing  the  VDRE  from  the  osteocalcin  promoter  were  obtained  from  J. 
Wesley  Pike.  Vectors  expressing  cTRa,  hRARa,  hRXRa,  rat  GR  (rGR),  hPR, 
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and  hER  have  been  described  previously  (17,  25,  26,  50,  53,  81).  The  NRIF3 
expression  vector  was  constructed  by  cloning  hill-length  NRIF3  into  a  pExpress 
vector  (25).  Appropriate  plasmids  were  transfected  into  HeLa  cells  by  calcium 
phosphate  coprecipitation  with  25  to  100  ng  of  the  receptors,  250  to  500  ng  of  the 
chloramphenicol  acetytransferase  (CAT)  reporters,  and  750  ng  of  the  NRIF3  or 
control  pExpress  vector.  After  transfection,  cells  were  incubated  at  37°C  (with  or 
without  cognate  ligands)  for  42  h  before  being  harvested.  CAT  assays  were 
carried  out  as  previously  described  (30).  Relative  CAT  activity  was  determined 
as  the  percent  acetylation  of  the  substrate  per  30  pg  of  cell  protein  in  a  IS-h 
incubation  at  37°C.  The  results  were  calculated  from  duplicate  or  quadruplicate 
samples,  and  the  variation  among  samples  was  less  than  10%. 

Domain  and  mutagenesis  analyses.  To  construct  pJG4-5-derived  vectors  ex¬ 
pressing  EnL  or  EnS,  the  pJG4-5/NRIF3  plasmid  was  digested  with  Afcol  and 
Xhol  and  the  resulting  vector  fragment  was  gel  purified.  This  fragment  was  then 
ligated  to  an  EnL  or  EnS  insert  generated  from  pExpress-EnL  or  pExpress-EnS 
by  NcoVSall  double  digestion.  The  resulting  pJG4-5/EnL  or  pJG4-5/EnS  plasmid 
was  confirmed  by  sequence  analysis.  The  L9A  mutant  form  of  NRIF3  was 
generated  by  site-directed  mutagenesis  by  a  PCR-based  method,  and  the  muta¬ 
tion  was  confirmed  by  sequence  analysis.  ^G4-5-derived  vectors  expressing  EnL, 
EnS,  or  the  L9A  NRIF3  mutant  form  were  transformed  into  yeast  strains  har¬ 
boring  the  LacZ  reporter  (pSH18-34)  and  appropriate  bait  plasmids  (LexA-TR, 
LexA-RAR,  LexA-RXR,  and  LexA-GR).  Transformants  were  subjected  to 
quantitative  assays  of  ^-galactosidase  activity  as  described  above. 

To  construct  the  bait  plasmid  expressing  LexA-NCD,  a  derivative  of  pEG202 
(which  contains  a  new  polylinker)  was  digested  with  Ncol  and  Xhol  and  ligated 
to  synthetic  oligonucleotides  that  encode  the  last  16  amino  acids  of  NR1F3 
(residues  162  to  177).  Similarly,  the  mutant  NCD  was  generated  by  using  oligo¬ 
nucleotides  that  contain  the  designed  mutations  in  the  ligation  reaction.  Both 
constructs  were  confirmed  by  sequence  analysis.  Bait  plasmids  expressing  LexA- 
NCD  or  LexA-mutant  NCD  were  transformed  together  with  one  of  the  following 
prey  plasmids,  B42-TR  LBD,  B42-RXR  LBD,  or  B42-RAR  LBD,  into  the  yeast 
strain  that  harbors  the  LacZ  reporter  (pSH18-34).  Subsequent  two-hybrid  assays 
were  carried  out  as  described  above. 

Docking  of  coactivator  peptides  to  receptors.  We  built  a  model  of  the  inter¬ 
action  between  the  17-residue  C-terminal  peptide  of  NRIF3  (KASRHLDSYE 
FLKAILNI  and  the  LBDs  of  several  receptors  (TRa  was  used  as  an  example  in 
the  experiment  reflected  in  Fig.  10).  An  LXXIL  motif  within  the  NRIF3  peptide 
is  underlined.  A  similar  modeling  procedure  was  carried  out  on  a  20-residue 
peptide  (SLTERHKILHRLLQEGSPSD)  of  the  second  LXXLL  box  of  SRC-1 
(52).  We  hypothesized  that  the  LXXIL  motif  of  the  C  terminus  of  NRIF3 
contacts  the  coactivator  binding  site  of  the  nuclear  receptors,  and  the  automatic 
docking  procedure  was  carried  out  towards  this  site  (71,  75,  76).  TVvo  critical 
features  of  the  interaction  between  the  LBDs  of  nuclear  hormone  receptors  and 
their  coactivators  were  used  to  build  the  models,  (i)  One  was  the  “charge  clamp,” 
initially  observed  in  the  complex  between  SRC-1  and  PPAR-y  (56),  where  a 
conserved  glutamate  and  lysine  at  opposite  ends  of  the  hydrophobic  cavity  of  the 
receptors  contact  the  backbone  of  the  coactivator’s  LXXLL  box.  This  feature 
enabled  us  to  orient  the  NRIF3  helical  peptide,  (ii)  The  other  feature  was  that 
the  leucines  of  the  LXXLL  motif  of  SRC-1  are  buried  in  the  hydrophobic  cavity 
of  the  receptor.  This  feature  allowed  us  to  predict  the  side  of  the  NRIF3  peptide 
which  faces  the  receptor. 

The  coactivator  peptides  were  assigned  a  heiical  secondary  structure,  the 
backbone  <|>  and  i|i  angles  being  —62  and  —41  degrees,  respectively.  The  oi  angle 
was  set  to  180  degrees.  Loose  distance  restraints  were  set  between  the  charge 
clamp  of  the  receptors  (56)  and  C“  atoms  of  the  peptide.  The  energy  of  the 
complex  was  minimized  in  the  internal  coordinate  space  by  using  the  modified 
ECEPP/3  potentials.  The  subset  of  the  variables  minimized  by  the  ICM  method 
(1, 71, 76)  included  the  side  chains  of  the  receptor,  six  positionai  variables  of  the 
helix,  and  the  side  chain  torsion  angles  of  the  helix. 

Binding  energy  calculation.  The  binding  energy  was  calculated  by  the  parti¬ 
tioning  method  as  described  elsewhere  (64).  Briefly,  the  binding  energy  function 
is  partitioned  into  three  terms:  the  surface  (or  hydrophobic)  term,  determined  as 
the  product  of  the  solvent-accessible  surface  by  a  surface  tension  of  30  cal/mol/ 
A^;  the  electrostatic  term,  calculated  by  a  boundary  element  algorithm,  with  a 
dielectric  constant  of  8;  and  the  entropic  term,  which  results  from  the  decrease 
in  conformational  freedom  of  residue  side  chains  partially  or  completely  buried 
upon  complexation. 


RESULTS 

Cloning  of  NRIF3  cDNA.  To  isolate  potential  coactivators 
mediating  the  transcriptional  activation  functions  of  nuclear 
receptors,  we  employed  a  yeast  two-hybrid  screening  strategy 
(29).  A  bait  expressing  a  full-length  TRa  fused  to  the  C  ter¬ 
minus  of  the  LexA  DBD  was  used  to  screen  a  HeLa  cell  cDNA 
library  cloned  into  pJG4-5  (29).  Candidate  clones  that  exhib¬ 
ited  a  thyroid  hormone  (T3)-dependent  interaction  with  LexA- 
TRa  were  selected  and  further  examined  and  sequenced.  Four 
novel  clones  were  identified,  and  all  were  found  to  exhibit 
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FIG.  1.  Hormone-dependent  interaction  of  NRIF3  with  the  LBD  of  TR. 
Induction  of  3-galactosidase  activity  by  thyroid  hormone  (T3)  was  measured  in 
the  yeast  strain  EGY48  transformed  with  a  bait  vector  expressing  the  LexA- 
cTRa  LBD  and  a  prey  plasmid  expressing  NRIF3  fused  to  the  B42  activation 
domain  (29).  The  bait  I^xA  alone  was  used  as  the  negative  control.  The  prey 
B42-Tripl  was  used  as  the  positive  control.  Hatched  bars,  without  T3;  filled  bars, 
with  1  pM  T3. 


levels  of  interaction  with  the  LBD  of  TRa  similar  to  the  levels 
they  exhibited  with  the  full-length  receptor  (data  not  shown). 
These  clones  were  designated  NRIFl,  -2,  -3,  and  -4.  Not  sur¬ 
prisingly,  the  LBD  of  TRp  was  also  found  to  interact  with  these 
NRIFs  in  a  T3-dependent  manner  (data  not  shown).  Among 
these  four  isolated  NRIFs,  NRIF3  was  a  full-length  clone.  As 
shown  in  Fig.  1,  LexA  alone  (negative  control)  did  not  interact 
with  NRIF3  (as  indicated  by  the  low  p-galactosidase  activity) 
and  incubation  with  T3  had  no  effect.  Similarly,  no  interaction 
was  detected  between  the  LexA-TR  LBD  and  B42  alone  with 
or  without  T3  (data  not  shown).  The  LexA-TR  LBD  also 
showed  little  interaction  with  NRIF3  in  the  absence  of  T3. 
However,  incubation  with  T3  resulted  in  strong  stimulation  of 
the  NRIF3-TR  LBD  interaction  (Fig.  1).  The  extent  of  T3- 
dependent  interaction  between  NF^3  and  the  LexA-TR  LBD 
was  similar  to  that  of  Tripl  (Fig.  1),  one  of  the  first  TR- 
interacting  factors  cloned  in  a  two-hybrid  screen  (42). 

Sequence  analysis  of  NRIF3.  Sequence  analysis  of  the  NRIF3 
cDNA  revealed  a  single  open  reading  frame  encoding  a  poly¬ 
peptide  of  177  amino  acids  (Fig.  2).  NRIF3  has  no  homology 
with  members  of  the  SRC-1  and  CBP/p300  families.  The  size 
of  NRIF3  is  in  sharp  contrast  to  the  size  of  CBP/p300  (around 
300  kDa)  or  of  SRC-1  family  members  (around  160  kDa). 
NRIF3  contains  a  putative  nuclear  localization  signal  (KRKK), 
as  well  as  one  copy  of  an  LXXLL  motif  (amino  acids  9  to  13) 
that  was  recently  identified  as  being  essential  for  the  interac¬ 
tion  of  a  number  of  putative  coactivators  with  nuclear  recep¬ 
tors  (32). 

A  database  search  identified  two  highly  related  homologs  of 
NRIF3,  which  were  previously  designated  p3-endonexin  short 
and  long  forms  (67).  The  endonexin  short  form  (EnS)  was 
originally  isolated  from  a  two-hybrid  screen  intended  to  clone 
factors  that  interact  with  the  cytoplasmic  tail  of  integrin  ^3 
(67).  The  long  form  (EnL)  was  then  identified  as  an  alterna¬ 
tively  spliced  product  of  the  same  gene.  However,  the  long 
form  does  not  bind  to  integrin  p3  (67).  Nucleotide  sequence 
comparisons  between  cDNAs  of  NRIF3  and  EnS  or  EnL  in¬ 
dicate  that  NRIF3  is  a  third  alternatively  spliced  product  of  the 
same  gene  (alignment  not  shown).  The  precise  function(s)  of 
the  two  endonexin  proteins  is  under  investigation  (reference 
66a  and  see  Discussion). 
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1  CAGCGGCAGTGCTGaTTCCCGAATCTCACAATGCCTCTTAAAAGATCACTGMGnCGA 

MPVKRSLK  I.  D  U 


61  TGGTCTGnAGAAGAAAATTCATTTGATCCTTCAAAAATCACAAGGAAGAAAAGTGTTAT 

G  I  L  EENSFDP5KITRKKSVI  It 

121  AACTTATTCTCCAACAACTGGAACTTGTCAAATGACTCTATTTGCTTaCCaCAAGnC 

TYSPTTGTCQMSLFASPTSS  5» 

181  TGAAGAGCAAAAGCAaGAAATGGACTATCAAATGAAAAGAGAAAAAAATTGAATCACCC 

EEQKHRNCLSNE  K  R  K  K  L  N  H  P  n 

241  CAGrnAACTGAAAGCAAAaATCTACAACAAAAGACAATGATGAATTCATGATGnGCT 

slteskesttkdndefmmll  m 


301  ATCAAAAGTTGAGAAATTGTCAGAAGAAATCATGGAGATAATGCAAAATnAAGTACTAT 

SKVEKLSEEIMEIHQNLSSI  lit 

361  AaGGCrrTCGAGGGCAGTAGAGAGCTTGAAAATCTCATTGGAATCTCCTGTGCATCACA 

QALEGSRELENLIGISCASH  131 

421  TTTaTAAAAAGAGAAATGCAGAAAACCAAAGAACTAATGACAAAAGTGAATAAACAAAA 

FLKREHQKTKELMTKVNKQK  154 

481  ACTGTTTGAAAAGAGTACAGGACnCCTCACAAAGCArCACGTCATCTrGACAGCTATGA 

LFEKSTGLPHKA  S  R  H1.  0  S  Y  .E  IM 


541  ATTCCnAAAGCCATmAAAaGAGGCATTAAGAAGAAATGaCTCACCATGAGCACCA 
. . N  • 


FIG.  2.  Nucleotide  and  deduced  amino  acid  sequences  of  NRIF3.  Only  part 
of  the  cDNA  sequence  is  shown.  A  putative  nuclear  localization  signal  (KRKK) 
is  underlined,  llie  putative  LXXLL  motif  is  shown  with  a  double  underline. 
NRIF3  and  EnL  have  95%  identity.  They  differ  only  in  their  C  termini,  where  the 
last  16  amino  acids  (dotted  underline)  in  NRIF3  are  replaced  with  9  different 
amino  acids  (GQPQMSQPL)  in  EnL.  EnS  consists  of  111  amino  acids  and  is 
100%  identical  to  the  first  111  amino  acids  of  NRIF3  or  EnL. 


NRIFS  localizes  to  the  cell  nucleus.  Although  a  putative 
nuclear  localization  signal  was  found  in  NRIF3,  we  considered 
it  important  to  identify  the  subcellular  location  of  the  NRIF3 
protein  since  extensive  homology  was  found  between  NRIF3 
and  the  two  endonexins.  The  entire  NRIF3  open  reading 
frame  was  fused  to  the  C  terminus  of  GFP  (18).  The  resulting 
GFP-NRIF3  fusion  protein  was  expressed  in  HeLa  cells  by 
transient  transfection,  and  the  subcellular  location  of  the  fu¬ 
sion  protein  was  visualized  by  fluorescence  microscopy.  As 
shown  in  Fig.  3,  the  control  GFP  protein  distributed  through¬ 
out  the  cell  while  GFP-NRIF3  localized  exclusively  to  the 
nucleus.  This  result  suggests  that  NRIF3  is  a  nuclear  protein, 
which  is  compatible  with  its  putative  role  as  a  nuclear  receptor 
coactivator. 

Selective  interaction  of  NRIF3  with  liganded  nuclear  recep¬ 
tors  in  yeast.  Although  NRIF3  was  originally  cloned  with  full- 
length  TRa  as  the  bait,  we  later  identified  that  the  region  of 
the  receptor  responsible  for  NRIF3  binding  is  its  LED  (Fig.  1). 
A  common  feature  among  most  of  the  known  coactivators  that 
show  ligand-dependent  interaction  with  nuclear  receptors  is 
the  presence  of  the  LXXLL  motif(s)  in  their  receptor  interac¬ 
tion  domains.  The  LXXLL  motif  appears  to  be  involved  in 
direct  contact  with  a  structurally  conserved  surface  in  the  li¬ 
gand-bound  LEDs  of  the  receptors  (23),  which  may  provide 


GFP  GFP-NRIF3 


FIG.  3.  NRIF3  is  a  nuclear  protein.  HeLa  cells  were  transfected  with  an 
expression  vector  for  GFP  (left  panel)  or  GFP-NRIF3  (right  panel).  The  cellular 
location  of  the  expressed  proteins  was  visualized  by  fluorescence  microscopy. 


TABLE  1.  Interaction  of  NRIF3  with  nuclear  receptors  in  yeast” 


Bait 

Prey 

^-Galactosidase  activity 

Without 

ligand 

With 

ligand 

Fold 

stimulation 

LexA 

NRIF3-B42 

2.3 

1.9 

0.8 

LexA-TR 

NR1F3-B42 

1.8 

125 

69 

LexA-RAR 

NR1F3-B42 

0.1 

0.1 

1 

LexA-RXR 

NRIF3-B42 

0.2 

63 

315 

LexA-GR 

NRIF3-B42 

0.8 

0.6 

0.8 

”  The  LacZ  reporter  activities  were  determined  for  yeast  strains  harboring  the 
indicated  bait  and  prey  plasmids  in  the  presence  or  absence  of  cognate  ligands  as 
described  in  Materials  and  Methods.  &e  the  text  for  detailed  explanations. 


the  molecular  basis  for  the  broad  spectrum  of  receptor  binding 
by  coactivators  such  as  SRC-1  and  GRIPl.  Since  a  putative 
LXXLL  motif  is  also  present  in  NRIF3  (amino  acids  9  to  13), 
we  asked  whether  NRIF3  also  interacts  with  the  LEDs  of  other 
nuclear  receptors. 

The  LEDs  of  several  nuclear  receptors  were  examined  for 
interaction  with  NRIF3  in  a  yeast  two-hybrid  assay.  As  shown 
in  Table  1,  NRIF3  does  not  interact  with  LexA  alone  (negative 
control)  with  or  without  ligand.  LexA-TR  and  LexA-RXR 
showed  little  (if  any)  interaction  with  NRIF3  in  the  absence  of 
their  cognate  ligands.  However,  the  presence  of  T3  (for  TR)  or 
9-cis  RA  (for  RXR)  resulted  in  a  strong  stimulation  of  their 
interaction  with  NR1F3,  as  indicated  by  the  induction  of  p-ga- 
lactosidase  activity  (Table  1).  Interestingly,  when  LexA-RAR 
or  LexA-GR  was  used  as  the  bait,  no  interaction  was  detected 
with  NRIF3  in  the  presence  or  absence  of  their  cognate  ligands 
(Table  1).  The  finding  that  NRIF3  interacts  with  TR  but  not 
RAR  was  surprising  in  light  of  a  recent  study  which  showed 
that  TR  and  RAR  functionally  interact  with  the  same  LXXLL 
boxes  (boxes  2  and  3)  of  SRC-l/NCoA-1  (52).  As  positive 
controls,  we  confirmed  that  both  LexA-RAR  and  LexA-GR 
exhibited  ligand-dependent  interaction  with  other  coactivators 
that  are  not  receptor  specific  (data  not  shown).  Taken  to¬ 
gether,  these  results  suggest  that  NRIF3  exhibits  differential 
specificities  in  its  interactions  with  different  nuclear  receptors. 

NRIF3  specifically  binds  to  TR  and  RXR  but  not  to  other 
nuclear  receptors  in  vitro.  To  further  examine  the  interaction 
between  NRIF3  and  various  nuclear  receptors  as  well  as  to 
confirm  the  potential  receptor  specificity  of  NRIF3,  in  vitro 
GST  binding  assays  were  performed  (30).  ^*S-labeled  nuclear 
receptor,  generated  by  in  vitro  transcription  and  translation, 
was  incubated  with  purified  GST-NRIF3  or  the  GST  control 
bound  to  glutathione-agarose  beads.  All  binding  assays  were 
carried  out  with  or  without  the  cognate  ligand  of  the  examined 
receptor.  As  shown  in  Fig.  4  (top  left),  TR  and  NRIF3  interact 
poorly  in  the  absence  of  T3.  Addition  of  T3  resulted  in  a  strong 
increase  in  TR  binding  to  GST-NRIF3,  confirming  that  NRIF3 
associates  with  TR  in  a  T3-dependent  manner.  Using  similar 
binding  assays,  we  also  studied  the  interaction  of  NRIF3  with 
six  other  nuclear  receptors.  Consistent  with  our  findings  from 
the  yeast  two-hybrid  experiments  (Table  1),  NRIF3  interacted 
with  RXR  in  vitro  in  a  ligand-dependent  manner  (Fig.  4)  but 
showed  little  or  no  binding  to  other  nuclear  receptors  (RAR, 
VDR,  GR,  PR,  and  ER)  in  the  presence  or  absence  of  their 
cognate  ligands  (Fig.  4).  Taken  together,  the  results  of  the 
yeast  two-hybrid  (Table  1)  and  the  in  vitro  binding  (Fig.  4) 
assays  suggest  that  NRIF3  possesses  a  distinct  receptor  speci¬ 
ficity. 

NRIF3  selectively  potentiates  TR-  and  RXR-mediated  trans¬ 
activation  In  vivo.  To  examine  the  potential  role  of  NRIF3  in 
TR-mediated  transactivation,  transfection  studies  were  carried 
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FIG.  4.  Characterization  of  the  NRIF3  interaction  with  nuclear  receptors  in  vitro.  A  “S-labeled  full-length  receptor  (cTRa,  hRARa,  hRXRa,  hVDR,  hPR,  hGR, 
or  hER)  was  incubated  with  an  affinity-purified  GST  control  or  GST-NRIR  linked  to  glutathione-agarose  beads.  The  binding  was  performed  in  the  absence  (— )  or 
presence  (-f)  of  cognate  ligands  as  described  in  Materials  and  Methods.  After  incubation  and  washing,  the  bound  receptors  were  analyzed  by  sodium  dodecyl 
sulfate-10%  polyacrylamide  gel  electrophoresis  and  detected  by  autoradiography.  The  input  lane  in  each  binding  assay  represents  5%  of  the  total  ’*S-labeled  receptor 
used  in  each  incubation.  GST-RXR  was  used  as  a  positive  control  for  RAR  binding. 


out.  HeLa  cells,  which  lack  endogenous  TR  (25),  were  trans¬ 
fected  with  a  vector  expressing  TR  and  a  CAT  reporter  under 
the  control  of  the  AMTV  basal  promoter  linked  to  an  idealized 
IR  (AGGTCATGACCT)  TRE  sequence  (IR-AMTV-CAT) 
(25),  along  with  either  a  control  plasmid  or  a  vector  expressing 
NRff3.  As  shown  in  Fig.  5A,  NRIF3  significantly  enhances 
TR-mediated  activation  of  the  CAT  reporter  (typically  2.5-  to 
3-fold).  As  a  control,  we  also  examined  the  effect  of  CBP,  a 
reported  coactivator  for  nuclear  receptors  (13,  37),  and  found 
that  its  expression  results  in  a  degree  of  enhancement  similar 
to  that  with  NRIF3  (around  threefold)  (Fig.  5A). 

We  also  examined  another  CAT  reporter  controlled  by  the 
herpesvirus  tk  promoter  linked  to  native  rat  growth  hormone 
TRE  sequences  (5).  NRIF3  was  found  to  also  enhance  TR- 
mediated  activation  of  this  reporter  (about  3.5-fold)  (Fig.  5B). 


In  addition,  using  similar  transfection  assays,  we  found  that 
NRIF3  enhances  TR-mediated  activation  of  two  other  report¬ 
ers,  (IR)2-TATA-CAT  and  DR4-AMTV-CAT  (data  not  shown). 
Therefore,  NRIF3  potentiates  TR-mediated  transactivation  in 
a  variety  of  different  TRE  and  promoter  contexts.  Taken  to¬ 
gether,  the  results  of  these  transfection  studies  suggest  that 
NRIF3  can  function  as  a  coactivator  of  TR. 

To  examine  whether  NR1F3  can  also  act  as  a  coactivator  for 
RXR,  HeLa  cells  were  transfected  with  the  IR-AMTV-CAT 
reporter,  whose  IR  sequence  can  also  function  as  a  strong 
response  element  for  the  RXR(s)  and  RAR(s)  (25,  49,  61). 
HeLa  cells  express  the  endogenous  RXR(s)  and  RAR(s),  as 
the  activity  of  the  IR-AMTV-CAT  reporter  was  strongly  stim¬ 
ulated  by  their  cognate  ligands,  even  without  cotransfection  of 
any  receptor  expression  plasmid  (Fig.  6A,  bars  1,  3,  and  5). 


A 


I  IH  □-C  tMTV-CAT  1 


B 


I  GHTBE  I - 1  tk-CAT  ~1 


FIG.  5.  NR1F3  enhances  TR-mediated  transactivation  in  vivo.  HeLa  celts  were  transfected  with  a  vector  expressing  cTRa  and  the  IR-AMTV-CAT  reporter  (A) 
or  the  GH-TRE-tk-CAT  reporter  (B)  in  the  presence  (filled  bars)  or  the  absence  (hatched  bars)  of  1  p.M  T3.  The  vector  expressing  NRIF3  or  the  empty  control  vector 
was  cotransfected  to  examine  the  effect  of  NRIF3  on  TR-mediated  activation.  In  panel  A,  the  effect  of  CBP  was  compared  to  that  of  NRIF3.  GH,  growth  hormone. 
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FIG.  6.  NRIF3  functions  as  a  coactivator  for  RXR  but  not  RAR.  (A)  NRIF3  potentiates  the  activity  of  the  endogenous  RXR(s)  but  not  the  RAR(s).  HeLa  cells 
were  transfected  with  the  IR-AMTV-CAT  reporter  (without  any  receptor  expression  vector)  to  examine  the  activation  by  endogenous  retinoid  receptors.  The  NR1F3 
expression  vector  or  the  empty  control  vector  was  cotransfected  to  examine  the  effect  of  NRIF3  on  the  activity  of  the  endogenous  RXR(s)  or  RAR(s).  Relative  CAT 
activity  was  determined  in  the  presence  (filled  bars)  or  absence  (hatched  bars)  of  the  indicated  ligands  (1  pM).  (B  and  C)  NRIF3  potentiates  the  activity  of  the 
exogenously  expressed  RXR.  A  vector  expressing  hRXRa  was  cotransfected  into  HeLa  cells  with  the  IR-AMlV-CAT  reporter  (B)  or  the  DRl-AMTV-CAT  reporter 
(C)  in  the  presence  (filled  bars)  or  absence  (hatched  bars)  of  the  indicated  ligands  (I  (sM).  The  effect  of  NRIF3  on  RXR-mediated  transactivation  was  examined  as 
described  for  panel  A.  TTNPB,  a  synthetic  ligand  for  RAR. 


Cotransfection  of  NRIF3  enhanced  the  activation  of  this  re¬ 
porter  by  either  9<is  RA,  or  LG100153  (72),  an  RXR-specific 
ligand  (Pig.  6A,  bars  1  and  2  and  bars  3  and  4).  In  contrast, 
although  the  RAR-specific  ligand  TTNPB  (68)  also  activated 
the  IR-AMTV-CAT  reporter,  cotransfection  of  NRIF3  had 
no  effect  (Fig.  6A,  bars  5  and  6).  These  results  indicate  that 
NRIF3  potentiates  the  activity  of  the  endogenous  RXR(s)  but 
not  the  RAR(s),  which  is  consistent  with  the  distinct  receptor 
specificity  of  NRIF3  revealed  from  the  yeast  two-hybrid  assay 
(Table  1)  and  in  vitro  binding  experiments  (Fig.  4). 

To  further  document  that  NRiF3  can  function  as  a  coacti¬ 
vator  for  RXR,  a  vector  expressing  exogenous  RXR  was  co¬ 
transfected  with  IR-AMTV-CAT.  Exogenous  RXR  expression 
enhanced  the  activation  of  this  CAT  reporter  by  either  9-cis 
RA  or  LG100153  (compare  Fig.  6A  and  B,  bars  1  and  3).  This 
RXR-mediated  activation  of  repwiter  expression  was  further 
stimulated  by  NRIF3  (Fig.  6B).  Finally,  we  also  examined  the 
activation  of  a  DRl-AMTV-CAT  reporter.  This  DRl  (AGGT 
CANAGGTCA  [where  N  is  any  nucleotide])  sequence  is 
thought  to  be  a  specific  response  element  for  RXR  (39, 51). 
Although  we  found  that  this  DRl  is  a  weaker  response  ele¬ 
ment  than  the  IR  sequence,  cotransfection  of  an  RXR  expres¬ 
sion  vector  led  to  ligand-induced  activation  of  this  DRl  re¬ 
porter,  which  was  also  further  enhanced  by  NRIF3  (Fig.  6C). 

NRIF3  does  not  potentiate  the  activities  of  GR,  PR,  ER,  and 
VDR  in  vivo.  The  seiective  coactivation  of  TR  and  RXR  (but 
not  RAR)  by  NRIF3  is  consistent  with  its  distinct  binding 
specificities  to  these  receptors.  To  further  establish  that  NRIF3 
acts  as  a  receptor-specific  coactivator,  we  next  examined  the 
effect  of  NRIF3  on  the  activities  of  four  additional  nuclear 
receptors,  including  GR,  PR,  ER,  and  VDR,  by  transfection 
experiments.  HeLa  cells  were  transfected  with  a  GRE/PRE- 
tk-CAT  reporter  along  with  a  vector  expressing  either  GR  or 
PR.  As  shown  in  Fig.  7A,  cognate  hormone  treatment  results 
in  activation  of  the  CAT  reporter.  However,  expression  of 
NRIF3  has  little  effect  (Fig.  7A).  Similar  experiments  were 
carried  out  with  ER  and  ERE-AMTV-CAT  or  VDR  and 
VDRE-AMTV-CAT.  As  shown  in  Fig.  7B  and  C,  NRIF3  was 


found  to  have  little  or  no  effect  on  the  activities  of  these 
receptors  as  well.  Taken  together,  the  combined  results  of  our 
transfection  studies  support  the  notion  that  NRIF3  is  a  coac¬ 
tivator  with  a  unique  receptor  specificity. 

A  novel  C-terminal  domain  in  NR1E3  is  essentiai  for  iigand- 
dependent  interactions  with  TR  and  RXR.  The  LXXLL  sig¬ 
nature  motif  has  been  found  to  be  present  in  the  receptor- 
interacting  domains  of  many  identified  coactivators,  such  as 
SRC-l/NCoA-1  and  GRIPl/nF-2  (32).  The  broad  spectrum 
of  receptor  binding  by  coactivators  such  as  SRC-1  suggests  that 
the  LXXLL-containing  interacting  domain  may  recognize 
structurally  similar  surfaces  of  these  LBDs.  Indeed,  recent 
structural  and  functional  studies  revealed  that  the  LXXLL 
motif  and  its  nearby  flanking  amino  acids  are  involved  in  direct 
contact  with  a  hydrophobic  cleft  of  the  target  surfaces  pre¬ 
sented  by  the  ligand-bound  LBDs  of  nuclear  receptors  (19, 23, 
52,  56).  The  facts  that  NRIF3  also  contains  an  LXXLL  motif 
(amino  acids  9  to  13)  (Fig.  2  and  8A)  and  exhibits  a  distinct 
receptor  specificity  raise  the  possibility  that  (i)  the  motif  and 
surrounding  amino  acids  are  involved  in  mediating  receptor- 
specific  interaction  of  NRIF3  or  (ii)  another  region  of  NRIF3 
(alone  or  in  concert  with  the  LXXLL  motif  region)  plays  an 
important  role  in  mediating  such  an  interaction. 

To  explore  these  issues,  we  examined  whether  EnS  and  EnL, 
which  contain  the  same  LXXLL  motif  and  flanking  amino 
acids  as  NRIR,  can  interact  with  nuclear  receptors  in  a  yeast 
two-hybrid  assay  (Fig.  8).  EnS  consists  of  111  amino  acids  and 
is  100%  identical  to  the  first  111  residues  of  NRIF3,  while  EnL 
consists  of  170  amino  acids,  the  first  161  of  which  are  also 
100%  identical  to  the  same  region  in  NRJF3  (Fig.  2  legend  and 
Fig.  8A).  Thus,  NRIF3  and  EnL  differ  only  in  their  C  termini, 
with  NRIF3  having  a  unique  region  of  16  amino  acids  and  EnL 
having  a  unique  region  of  9  amino  acids  (Fig.  8A).  Interest¬ 
ingly,  despite  their  extensive  identity  with  NRIF3,  the  interac¬ 
tion  with  liganded  TR  or  RXR  is  completely  abolished  in  EnS 
and  EnL  (Fig.  8B).  We  also  examined  other  nuclear  receptors 
that  do  not  interact  with  NRIF3  and  found  that  they  also  do 
not  interact  with  EnS  or  EnL  (data  not  shown).  These  results 
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FIG.  7.  NRIF3  does  not  potentiate  the  activity  of  GR,  PR,  ER,  or  VDR.  HeLa  cells  were  transfected  with  the  following  CAT  reporters  and  appropriate  receptor 
expression  vectors:  GRE/PRE-tk-CAT  and  rGR  or  hPR  (A),  ERE-AMTV-CAT  and  hER  (B),  and  VDRE-AMTV-CAT  and  hVDR  (C).  Cells  were  incubated  in  the 
presence  (filled  bars)  or  absence  (hatched  bars)  of  100  nM  dexamethasone  for  GR,  progesterone  for  PR,  estradiol  for  ER,  and  l,25-(OH)2-vitamin  Da  for  VDR. 
Cotransfection  of  NRIP3  was  found  to  have  little  effect  on  the  activities  of  these  re^ptors. 


indicate  that  the  unique  C-terminal  domain  in  NRIF3  (NCD) 
(residues  162  to  177)  is  essential  for  its  specific  interaction  with 
liganded  TR  and  EIXR  while  the  N-terminal  LXXLL  motif 
(amino  acids  9  to  13)  and  its  flanking  sequences  are  not  suffi¬ 
cient  to  allow  for  detectable  receptor  interactions. 

Although  the  LXXLL  motif  was  found  to  be  insufficient  for 
interaction,  we  examined  whether  this  N-terminal  motif  of 
NRIF3  contributes  to  the  NRIF3-receptor  interaction  by 
mutating  the  first  leucine  of  the  LXXLL  motif  into  alanine 
(L9A)  by  site-directed  mutagenesis.  Previous  experiments 
have  shown  that  the  three  leucine  residues  are  essential  for  an 
LXXLL  module  to  interact  with  receptor  LBDs  and  that  the 
replacement  of  any  of  them  with  alanine  abolishes  the  inter¬ 
action  (32).  We  examined  the  L9A  NRIF3  mutant  form  for  its 
interaction  with  TR  and  RXR  in  a  yeast  two-hybrid  assay.  As 


shown  in  Fig.  9,  the  L9A  mutant  was  still  capable  of  ligand- 
dependent  interaction  with  TR  and  RXR  (~25-fold  induction 
by  ligand).  However,  the  introduced  mutation  reduced  the 
interaction  by  about  4-fold  (for  TR)  or  14-fold  (for  RXR). 
These  results  suggest  that  although  the  LXXLL  motif  is  not 
absolutely  essential  for  NRIF3  interaction  with  liganded  re¬ 
ceptors,  it  plays  a  role  in  allowing  an  optimum  interaction  to 
occur. 

Computer  modeling  suggests  that  the  NCD  docks  into  the 
hydrophobic  cleft  of  the  liganded  LBDs.  Secondary-structure 
analysis  of  the  C-terminal  domain  of  NRIF3  predicted  the 
formation  of  an  a-helix.  Moreover,  inspection  of  the  putative 
C-terminal  helix  revealed  an  LXXIL  motif  (amino  acids  172  to 
176),  which  is  reminiscent  of  the  canonical  LXXLL.  Although 
the  ultimate  elucidation  of  the  molecular  basis  of  the  NRIF3- 


A 


B 


FIG.  8.  The  NCD  is  essential  for  the  interaction  with  liganded  TR  or  RXR.  (A)  Schematic  comparison  of  NR1F3  with  EnS  and  EnL.  EnS  is  100%  identical  to 
the  first  111  amino  acids  of  NR1F3  and  EnL  (open  boxes).  The  regions  from  amino  acids  112  to  161  in  NRIF3  and  EnL  (stippled  boxes)  are  100%  identical.  NRIF3 
and  EnL  differ  in  their  C  termini  (16  amino  acids  in  NRIF3  [hatched  box]  and  9  amino  acids  in  EnL  [filled  box]).  The  positions  of  the  LXXLL  motif  and  a  putative 
nuclear  localization  signal  (KRKK)  are  also  indicated.  (B)  NRIF3  (N),  EnS  (S),  or  EnL  (L)  was  examined  for  interaction  with  LexA-TR  or  LexA-RXR  in  a  yeast 
two-hybrid  assay  as  described  in  Materials  and  Methods.  The  assays  were  performed  in  the  absepce  (hatched  bars)  or  the  presence  (filled  bars)  of  1  pM  T3  (for  TR) 
or9-ciiRA(forRXR). 
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Balts  TR  TR  RXR  RXR 

Preys  WT  L9A  WT  L9A 

FIG.  9.  The  LXXLL  motif  of  NRIF3  is  required  for  optimum  interaction 
with  TR  and  RXR.  Wild-type  NRIF3  (WT)  or  the  L9A  NRIF3  mutant  (L9A) 
was  examined  for  interaction  with  LexA-TR  or  LexA-RXR  in  a  yeast  two-hybrid 
assay  as  described  in  Materials  and  Methods.  p-Galactosidase  activities  were 
determined  in  the  absence  (filled  bars)  or  presence  (stippled  bars)  of  cognate 
ligands  (1  p.M  T3  for  TR;  1  pM  9-cis  RA  for  RXR). 


receptor  interaction  awaits  future  studies  such  as  X-ray  crys¬ 
tallography,  the  putative  helix  structure  of  the  NCD  and  its 
LXXIL  motif  suggest  that  the  NCD  may  interact  with  the 
liganded  LBDs  in  a  fashion  similar  to  that  of  the  receptor¬ 
interacting  domains  that  employ  the  canonical  LXXLL  mo- 
tif(s).  To  explore  this  possibility,  we  modeled  the  interaction  of 
the  C  terminus  of  NRIF3  with  the  liganded  LBDs,  using  algo¬ 
rithms  developed  mainly  by  the  staff  of  the  laboratory  of  one  of 
the  authors  (R.  Abagyan  and  coworkers)  (1,  63,  70,  74,  75). 
The  background  information  and  procedures  used  for  con¬ 
structing  these  models  are  described  in  Materials  and  Meth¬ 
ods.  The  results  of  our  modeling  suggest  that  the  NCD  fits  well 
into  the  hydrophobic  cleft  formed  on  the  LBDs  as  a  result  of 
ligand  binding.  An  example  of  such  a  model  (NCD-TR  LBD) 
is  shown  in  Fig.  10.  In  this  model,  the  two  leucines  and  one 
isoleucine  of  the  LXXIL  motif  are  predicted  to  be  deeply 
buried  in  the  central  cavity  of  the  hydrophobic  groove  formed 
by  the  liganded  LBD  of  the  receptor.  We  also  calculated  the 
putative  binding  energy  for  the  modeled  NCD-TR  complex, 
using  an  improved  partitioning  binding  energy  function,  with 
continuum  representation  of  the  electrostatics  of  the  system 
(64).  The  calculated  binding  energy  for  the  modeled  NCD-TR 
complex  is  about  -21  kcal/mol.  As  a  control,  we  carried  out  a 
similar  modeling  procedure  using  the  second  LXXLL  box 
within  the  receptor-interacting  domain  of  SRC-1.  This  LXXLL 
box  has  been  shown  to  be  required  for  interaction  with  TR 
(52).  Our  calculated  binding  energy  for  this  LXXLL  box  with 
liganded  TR  LBD  is  -18  kcal/mol,  a  value  that  is  very  close  to 
the  one  calculated  for  the  NCD.  Altogether,  our  modeling  and 
calculations  suggest  a  mechanism  in  which  the  NCD  directly 
mediates  interaction  with  liganded  LBDs  through  an  LXXIL 
motif. 

Functional  interaction  of  the  NCD  with  iiganded  LBDs  and 
the  essentiai  roie  of  its  LXXIL  motif.  To  explore  the  possibility 
suggested  from  our  computer  modeling,  the  NCD  (amino  acids 
162  to  177)  was  fused  to  the  LexA  DNA  binding  domain  and 
was  examined  for  interaction  with  the  receptor  LBDs  in  a  yeast 
two-hybrid  assay.  The  LexA-NCD  fusion  protein  alone  does 
not  activate  the  LacZ  reporter  in  yeast  (data  not  shown).  As  a 
negative  control,  we  also  found  that  LexA-NCD  does  not  in¬ 
teract  with  the  B42  activation  domain  itself  (Fig.  11)  and  that 
LexA  alone  does  not  interact  with  the  receptor  LBDs  (data  not 
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FIG.  10.  Hypothetical  model  of  the  interaction  of  the  NCD  and  the  liganded 
LBD.  The  docking  of  the  C-terminal  helix  of  NRIF3,  which  contains  an  LXXIL 
module,  to  the  ligand-bound  LBDs  was  carried  out  as  described  in  Materials  and 
Methods.  The  NCD-TR  LBD  model  is  shown  here  as  an  example.  The  side 
chains  of  the  two  leucines  (green)  and  one  isoleucine  (cyan)  of  the  LXXIL  core 
fit  within  a  hydrophobic  groove  (salmon)  on  the  surface  of  the  liganded  LBD 
(80).  A  similar  modeling  procedure  was  carried  out  with  an  LXXLL  box  of 
SRC-1  (result  not  shown).  Putative  binding  energies  (-21  kcal/mol  for  the  NCD 
and  -18  kcal/mol  for  the  LXXLLboxof  SRC-1)  were  calculated  as  described  in 
Materials  and  Methods.  See  the  text  for  details. 


shown).  However,  when  the  LexA-NCD  and  the  LBD  of  TR  or 
RXR  (fused  with  B42)  were  used  in  the  two-hybrid  assay,  a 
strong  ligand-dependent  interaction  was  observed,  as  indicated 
by  the  induction  of  3-galactosidase  activity  by  their  cognate 


Balt*  WT  WT  WT  WT  Mut  Mut  Mut 

Prey*  B42  TR  RXR  RAR  TR  RXR  RAR 

FIG.  II.  Interaction  of  the  NCD  with  the  receptor  LBDs  and  the  role  of  the 
LXXIL  motif.  The  wild-type  NCD  (WT)  or  the  NCD  mutant  form  (Mut)  in 
which  the  three  core  hydrophobic  residues  of  the  LXXIL  motif  (two  leucines  and 
one  isoleucine)  are  changed  into  alanines  was  examined  for  interaction  with  the 
LBDs  of  TR,  RXR,  and  RAR  in  a  yeast  two-hybrid  assay  as  described  in 
Materials  and  Methods.  p-Galactosidase  activities  were  determined  in  the  ab¬ 
sence  (open  bars)  or  presence  (stippled  bars)  of  cognate  ligands.  The  prey 
expressing  B42  alone  was  used  as  a  negative  control. 
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ligands  (Fig.  11).  These  results  suggest  that  the  NCD  can 
directly  interact  with  the  LBDs  of  TR  and  RXR  in  a  ligand- 
dependent  manner. 

Since  NRIF3  harbors  a  distinct  receptor  specificity  in  inter¬ 
acting  only  with  TR  and  RXR  and  not  other  receptors  (e.g., 
RAR),  we  next  asked  whether  the  NCD  also  harbors  a  recep¬ 
tor  specificity.  To  our  surprise,  the  NCD  was  found  to  interact 
efficiently  with  the  LBD  of  RAR  in  a  ligand-dependent  manner 
(Fig.  11).  Therefore,  while  our  results  clearly  suggest  that  the 
NCD  is  an  important  surface  for  receptor  interactions,  as  the 
NCD  is  found  to  be  both  essential  for  (Fig.  8)  and  sufficient  to 
mediate  (Fig.  11)  such  interactions,  it  nevertheless  does  not 
appear  to  be  (solely)  responsible  for  the  receptor  specificity  of 
NRIF3.  It  is  possible  that  another  region  of  the  NRIF3  mole¬ 
cule  contributes  to  the  observed  receptor  specificity  of  NRIF3 
and/or  that  the  specificity  is  determined  by  the  overall  three- 
dimensional  structure  of  NRIF3. 

Since  our  model  predicts  the  importance  of  the  LXXIL 
motif  in  the  NCD-receptor  interaction  (Fig.  10),  we  tested  this 
by  changing  the  three  core  residues  of  the  motif  (two  leucines 
and  one  isoleucine)  into  alanines.  As  expected,  interaction 
with  the  LBDs  is  completely  abolished  in  the  resulting  mutant 
NCD  (Fig.  11),  confirming  that  the  LXXIL  motif  is  essential 
for  the  interaction. 

DISCUSSION 

Recent  efforts  in  understanding  receptor-mediated  tran¬ 
scription  have  led  to  the  identification  of  a  number  of  co¬ 
activators  for  nuclear  hormone  receptors,  which  can  be  cat¬ 
egorized  into  two  main  groups  based  on  overall  homology, 
the  SRC-1  family  (including  SRC-l/NCoA-1,  TIF2/GRIP1/ 
NCoA-2,  and  AIBl/p/CIP/ACTR/RAC3/TRAM-l)  (2,  14,  34, 
35, 37, 44, 58, 73, 74, 79)  and  the  CBP/p300  family  (13, 31, 37). 
Other  putative  coactivators  (e.g.,  ARA70  and  PGC-1)  that  do 
not  belong  to  the  SRC-1  or  CBP/p300  family  have  also  been 
identified  (60, 85).  In  addition,  p/CAF  may  also  be  involved  in 
receptor  action  through  its  association  with  nuclear  receptors 
as  well  as  with  other  coactivators  (11, 14, 38, 83).  Among  these 
known  coactivators,  CBP/p300,  members  of  the  SRC-1  group, 
and  p/CAF  all  possess  histone  acetyltransferase  activities  (8, 
14,  57,  69,  83). 

In  this  study  we  report  the  identification  of  a  novel  nuclear 
protein  (NRIF3)  which  exhibits  specific  ligand-dependent  in¬ 
teractions  with  TR  and  RXR  but  not  with  RAR,  VDR,  GR, 
PR,  or  ER.  Functional  experiments  indicated  that  NRIF3  po¬ 
tentiates  TR-  and  RXR-mediated  transactivation  in  vivo  but 
exhibits  little  or  no  effect  on  the  activities  of  other  examined 
receptors.  Therefore,  NRIF3  represents  a  novel  coactivator 
with  a  distinct  receptor  specificity  and,  thus,  may  shed  light  on 
clarifying  the  molecular  mechanism(s)  underlying  receptor- 
specific  regulation  of  gene  expression. 

A  database  search  indicated  that  NRIF3  has  no  homology 
with  any  known  coactivators  except  in  a  single  LXXLL  motif. 
An  unusual  feature  of  NRIF3  is  its  relatively  small  size,  which 
is  in  sharp  contrast  to  the  sizes  of  SRC-1  and  CBP/p300.  A 
homology  search  identified  two  alternatively  spliced  isoforms 
of  NRIF3  which  were  previously  designated  p3-endonexin 
short  and  long  forms  (67).  Preliminary  studies  with  these  two 
endonexins  indicate  that,  like  NRIF3,  they  localize  to  the  cell 
nucleus  (43a,  66a).  Interestingly,  despite  their  extensive  iden¬ 
tities  with  NRIF3,  both  EnS  and  EnL  fail  to  exhibit  interaction 
with  liganded  nuclear  receptors  (Fig.  8).  Consistent  with  this 
finding,  we  found  that  EnS  and  EnL  have  little  effect  on  re¬ 
ceptor-mediated  transcription  in  transfection  experiments 
(data  not  shown).  Therefore,  the  precise  roles  of  these  two 
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endonexins  remain  to  be  elucidated.  We  suggest  two  not  mu¬ 
tually  exclusive  possibilities.  First,  since  both  EnL  and  EnS 
appear  to  localize  to  the  nucleus,  it  is  possible  that  they  act  as 
cofactors  for  other  transcriptional  regulators.  Second,  since  the 
EnS  can  interact  with  the  cytoplasmic  tail  of  p3-integrin  (22, 
67),  it  may  communicate  signals  generated  at  the  plasma  mem¬ 
brane  to  the  cell  nucleus.  An  example  of  a  protein  which  is 
involved  in  both  cell  adhesion  and  transcriptional  regulation  is 
P-catenin  (82). 

Previous  study  of  the  endonexins  identified  the  presence  of 
NRIF3-related  mRNAs  (by  Northern  blotting)  in  a  wide  range 
of  human  tissues  (67).  Because  NRIF3  and  EnL  contain  almost 
identical  nucleotide  sequences  and  differ  only  by  an  alternative 
splice  site  which  results  in  the  removal  of  a  small  exon  in 
NRIF3,  it  is  difficult  to  specifically  identify  NRIF3  mRNA  by 
Northern  blotting.  A  search  of  the  expressed  sequence  tag 
database  indicates  that  NRIF3  mRNA,  as  well  as  both  EnL 
and  EnS  mRNAs,  is  expressed.  However,  the  precise  determi¬ 
nation  of  cell  and  tissue  distribution  of  NRIF3,  EnS,  and  EnL 
will  require  the  development  of  highly  selective  antibodies. 
Nevertheless,  the  wide  expression  pattern  of  NRIF3-related 
mRNAs  is  consistent  with  the  role  of  NRIF3  as  a  coactivator  of 
the  TRs,  which  are  also  widely  expressed  (70),  or  the  RXRs, 
which  are  ubiquitously  expressed  (48). 

A  key  goal  concerning  the  action  of  nuclear  hormone  recep¬ 
tors  is  to  understand  the  molecular  events  underlying  the  func¬ 
tional  specificities  of  different  receptors  in  regulating  the  ex¬ 
pression  of  their  target  genes.  Determinants  of  specificity 
include  specific  ligand  binding  and  selective  binding  of  the 
receptors  to  their  cognate  response  elements,  as  well  as  specific 
expression  patterns  of  different  receptors.  These  determinants 
alone,  however,  are  not  always  sufficient  to  explain  the  extents 
of  specificity  observed  for  members  of  the  nuclear  receptor 
family.  For  example,  several  members  of  the  thyroid  hormone/ 
retinoid  receptor  subfamily  may  bind  similarly  to  common 
DNA  elements  while  target  genes  containing  those  elements 
are  only  selectively  activated  by  certain  receptors  (20,  47). 
Therefore,  it  is  likely  that  additional  factors  (determined  by 
ceil  and  promoter  contexts)  are  involved  in  determining  recep¬ 
tor  functional  specificity.  In  this  respect,  most  known  coacti¬ 
vators  do  not  appear  to  be  receptor  specific.  For  example, 
members  of  the  SRC-1  and  CBP/p300  families  interact  with 
and  appear  to  be  involved  in  the  actions  of  many  nuclear 
receptors  (13, 14, 34, 37).  Two  known  coactivators  that  may  be 
involved  in  receptor-specific  functions  are  ARA70  and  PGC-1. 
The  AR  coactivator  ARA70  has  been  reported  to  potentiate 
the  activity  of  AR  more  efficiently  than  it  does  the  activities  of 
other  nuclear  receptors  (85).  However,  whether  ARA70  can 
associate  with  other  receptors  remains  to  be  thoroughly  exam¬ 
ined.  The  expression  of  PGC-1  is  restricted  mainly  to  the 
brown  fat  tissue  and  is  thought  to  be  directly  involved  in  the 
regulation  of  thermogenesis  by  PPAR7  (60).  Nevertheless, 
PGC-1  exhibits  a  relatively  broad  spectrum  of  binding  to  dif¬ 
ferent  nuclear  receptors.  Therefore,  the  identification  of 
NRIF3  represents  the  first  example  of  a  coactivator  with  such 
a  clearly  defined  receptor  specificity. 

The  receptor  specificity  of  NRIF3  raises  an  interesting  ques¬ 
tion  about  its  molecular  mechanism.  Domain  analysis  suggests 
that  the  LXXLL  motif  (amino  acids  9  to  13)  and  its  flanking 
sequences  in  NRIF3  are  not  sufficient  for  interaction  with 
liganded  nuclear  receptors.  In  fact,  such  interaction  is  com¬ 
pletely  abolished  in  EnL,  an  alternatively  spliced  product 
which  has  the  same  LXXLL  motif  and  contains  the  first  161 
amino  acids  (of  a  total  of  177  amino  acids)  of  NRIF3.  This 
result  suggests  that  a  putative  domain  consisting  of  the  last  16 
amino  acids  of  NRIF3  (residues  162  to  177)  is  essential  for  its 
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interaction  with  liganded  receptors.  Inspection  of  this  NCD 
indicates  that  it  contains  an  LXXIL  motif  (amino  acids  172  to 
176),  and  secondary-structure  analysis  predicts  the  formation 
of  an  a-helix.  The  predicted  helix  structure  and  the  similarity 
of  LXXIL  to  the  canonical  LXXLL  raise  the  possibility  that 
this  LXXIL-containing  region  plays  a  direct  role  in  NRIF3- 
receptor  interactions. 

Our  modeling  of  the  NCD-LBD  interaction  (Fig.  10)  sug¬ 
gests  that  the  same  hydrophobic  groove  in  the  ligand-bound 
LBD,  which  has  been  shown  by  previous  studies  to  be  the 
binding  site  for  coactivators  such  as  SRC-l/NCoA-1  and 
GRIPl  (19, 23, 56),  may  also  be  a  suitable  site  for  the  docking 
of  the  C-terminal  helix  of  NRIF3.  Thus,  the  utilization  of  the 
complementary  pair  of  an  a-helix  (in  the  coactivator)  and  a 
hydrophobic  groove  (in  the  receptor)  for  interaction  seems  to 
be  a  general  scheme  that  also  applies  to  NRIF3.  The  binding 
energy  estimated  for  the  NCD  and  the  TR  LBD  (-21  kcal/ 
mol)  is  similar  to  the  value  calculated  for  the  second  LXXLL 
box  of  SRC-l/NCoA-1  and  the  TR  LBD  (-18  kcal/mol).  To 
explore  the  mechanisms  suggested  by  the  modeling,  we  found 
that  the  NCD  can  directly  mediate  interaction  with  the  LBDs 
in  a  ligand-dependent  manner  (Fig.  11).  Moreover,  the  LXXIL 
motif  contained  in  the  NCD  was  found  to  be  essential  for  such 
interactions  (Fig.  11).  In  summary,  the  results  of  a  combination 
of  a  computer  modeling  and  domain  and  mutagenesis  analyses 
clearly  suggest  that  the  NCD  is  an  important  surface  that  is 
directly  involved  in  interaction  with  the  LBDs  of  the  receptors, 
where  the  LXXIL  motif  of  the  NCD  mimics  the  function  of  a 
canonical  LXXLL.  The  AF-2  helix  (which  is  a  critical  constit¬ 
uent  of  the  hydrophobic  groove  formed  upon  ligand  binding) 
of  the  LBD  has  been  shown  to  be  important  for  interaction 
with  the  LXXLL  boxes  of  the  coactivators  (23).  Interestingly, 
we  have  examined  two  TR  AF-2  mutants  (66)  and  found  that 
in  both  cases,  ligand-dependent  interaction  with  NRIF3  was 
abolished  (43a). 

However,  the  NCD  alone  does  not  appear  to  harbor  the 
same  specificity  as  NRIF3  (Fig.  11).  Thus,  it  seems  likely  that 
another  part  of  the  NRIF3  molecule  contributes  to  the  ob¬ 
served  specificity  and/or  that  the  specificity  is  determined  by 
the  overall  three-dimensional  structure  of  NRIF3.  In  this  re¬ 
gard,  the  potential  role  of  the  N-terminal  LXXLL  motif  is 
intriguing.  Although  the  N-terminal  LXXLL  motif  (amino  ac¬ 
ids  9  to  13)  is  insufficient  alone  to  mediate  an  interaction  with 
TR  or  RXR  (Fig.  8),  it  can  influence  the  interaction  of  NRIF3 
with  these  receptors,  as  the  L9A  NRIF3  mutant  retains  a 
significant  but  nevertheless  reduced  level  of  association  with 
liganded  TR  or  RXR  (Fig.  9).  Thus,  NRIF3  appears  to  employ 
at  least  two  regions  in  interacting  with  liganded  TR  or  RXR, 
with  the  NCD  playing  an  essential  role  and  the  N-terminal 
LXXLL  motif  playing  a  secondary  role.  A  simplified  explana¬ 
tion  for  these  findings  is  that  the  NCD  provides  a  major  surface 
for  receptor  binding  while  the  N-terminal  LXXLL  motif  makes 
some  minor  contact  with  either  the  same  receptor  molecule  or, 
more  likely,  with  the  other  partner  of  a  homodimer  or  het¬ 
erodimer  to  further  stabilize  the  NRIF3-receptor  interaction. 
An  example  of  a  coactivator  molecule  employing  two  separate 
regions  to  interact  with  the  two  partners  of  a  receptor  dimer 
has  been  documented  for  the  recently  solved  crystal  structure 
of  liganded  PPAR-y  complexed  with  SRC-l/NCoA-1  (56).  If 
NRIF3  indeed  employs  both  its  NCD  and  its  N-terminal 
LXXLL  motif  in  receptor  interactions,  the  specificity  may  re¬ 
sult  from  the  intramolecular  dialog  between  the  two  regions  as 
well  as  the  intermolecular  dialog  among  NRIF3  and  the  re¬ 
ceptors.  However,  it  remains  possible  that  the  N-terminal 
LXXLL  plays  only  a  more  indirect  role  and  that  the  overall 


three-dimensional  structure  of  NRIF3  is  responsible  for  its 
observed  specificity. 

Accumulating  evidence  suggests  that  the  actions  of  tran¬ 
scriptional  activating  proteins  are  (usually)  mediated  by  mul¬ 
tiprotein  complexes  (59),  and  such  a  scheme  is  also  likely  for 
nuclear  receptors.  For  example,  biochemical  evidence  suggests 
that  multiprotein  complexes  associate  with  liganded  TR  and 
VDR  (24, 62, 86).  Interestingly,  many  of  the  proteins  identified 
in  these  studies  are  not  known  coactivators.  While  the  study  of 
known  coactivators  such  as  CBP/p300  and  members  of  the 
SRC-1  family  has  suggested  that  histone  acetylation  may  play 
an  important  role  in  receptor-mediated  transactivation  (8, 14, 
57,  69),  detailed  elucidation  of  the  transactivation  mecha- 
nism(s)  used  by  these  receptors  awaits  the  identification  and 
study  of  additional  cofactors  involved  in  the  transactivation 
process. 

Our  results  with  NRIF3  suggest  that  transcriptional  activa¬ 
tion  by  nuclear  receptors  may  employ  a  receptor-specific  co- 
activator(s)  in  addition  to  the  generally  used  coactivators  such 
as  CBP  and  SRC-1.  Therefore,  coactivators  with  NRIF3-like 
properties  may  contribute  to  the  functional  specificities  of  nu¬ 
clear  receptors  in  vivo.  Based  on  our  results  with  NRIF3  and 
the  results  of  previous  studies  of  nuclear  receptor  action,  we 
suggest  a  combinatorial  specificity  model  where  a  coactivation 
complex  is  likely  composed  of  two  kinds  of  factors;  general 
factors  that  interact  with  and  are  involved  in  the  action  of  many 
nuclear  receptors  (such  as  CBP  and  SRC-1)  and  specific  fac¬ 
tors  that  exhibit  receptor  specificity  (such  as  NRIF3).  In  addi¬ 
tion  to  interacting  with  the  liganded  receptor,  coactivators  may 
also  communicate  with  each  other  within  the  coactivation  com¬ 
plex  through  protein-protein  interactions  (e.g.,  CBP/p300  can 
interact  with  SRC-l/NCoA-1  or  p/CIP)  (37, 74, 84).  An  intrigu¬ 
ing  possibility  is  that  the  combinatorial  actions  of  specific  fac¬ 
tors  and  other  partners  involved  in  the  transactivation  process 
facilitate  the  recruitment  of  specific  coactivation  complexes  for 
different  receptors  (under  different  cell,  promoter,  and  HRE 
contexts),  which  would  provide  an  important  mechanistic  layer 
for  receptor  functional  specificity.  An  advantage  of  employing 
such  a  combinatorial  strategy  is  that  a  broad  array  of  diversity 
can  be  generated  from  a  relatively  small  number  of  involved 
factors.  Further  study  of  NRIF3  with  known  and  possibly  other 
yet  to  be  identified  coactivators,  as  well  as  analysis  of  the 
interplay  among  these  coactivators,  should  provide  important 
insights  into  the  molecular  mechanism(s)  underlying  the  spec¬ 
ificity  of  receptor-mediated  regulation  of  target  gene  expres¬ 
sion. 
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Nuclear  hormone  receptors  (NRs)  are  potential  targets  for  thera¬ 
peutic  approaches  to  many  clinical  conditions,  including  cancer, 
diabetes,  and  neurological  diseases.  The  crystal  structure  of  the 
ligand  binding  domain  of  agonist-bound  NRs  enables  the  design  of 
compounds  with  agonist  activity.  However,  with  the  exception  of 
the  human  estrogen  receptor-a,  the  lack  of  antagonist-bound 
"inactive"  receptor  structures  hinders  the  rational  design  of  re¬ 
ceptor  antagonists.  In  this  study,  we  present  a  strategy  for  design¬ 
ing  such  antagonists.  We  constructed  a  model  of  the  inactive 
conformation  of  human  retinoic  acid  receptor-a  by  using  informa¬ 
tion  derived  from  antagonist-bound  estrogen  receptor-a  and  ap¬ 
plied  a  computer-based  virtual  screening  algorithm  to  identify 
retinoic  acid  receptor  antagonists.  Thus,  the  currentiy  available 
crystal  structures  of  NRs  may  be  used  for  the  rational  design  of 
antagonists,  which  could  lead  to  the  development  of  novel  drugs 
for  a  variety  of  diseases. 

Members  of  the  nuclear  hormone  receptor  (NR)  family  are 
under  the  control  of  a  wide  variety  of  hormones  and 
ligands,  such  as  steroids,  retinoids,  thyroid  hormone,  1,25- 
dihydroxy-vitamin  D3,  and  prostanoids.  Many  of  these  NRs  are 
potential  targets  for  the  therapy  of  a  variety  of  diseases:  antag¬ 
onists  of  estrogen  receptor-a  (ERa)  (e.g.,  tamoxifen)  are  clin¬ 
ically  used  for  the  treatment  of  breast  cancer  (1)  whereas  retinoic 
acid  receptor  (RAR)  agonists  and  antagonists  block  the  growth 
of  a  number  of  neoplastic  cells  including  breast  tumor  cells  (2, 
3).  Agonists  for  retinoid  X  receptors  (RXRs)  and  peroxisome 
proliferator-activated  receptor  y  (PPARy)  are  potential  candi¬ 
dates  for  use  in  the  treatment  of  cancer  and  diabetes  (PPARy  is 
the  receptor  for  the  antidiabetic  drug  thiazolidinedione)  (4-7), 
whereas  Nurrl  ligands  may  be  useful  for  treatment  of  Parkin¬ 
son’s  disease  (8).  Thus,  designing  molecules  that  selectively 
activate  or  inhibit  specific  NRs  is  of  considerable  biological 
significance  and  will  likely  have  the  potential  for  use  in  important 
clinical  applications. 

The  crystal  structures  of  the  ligand  binding  domain  (LED)  of 
many  members  of  the  NR  family  recently  have  been  solved,  and 
the  ligand-dependent  structural  changes  involved  in  transcrip¬ 
tional  activation  have  been  clarified,  enabling  the  structure- 
based  design  of  specific  agonists  (9, 10).  Recent  studies  on  ERa 
also  have  shed  light  on  the  LSD  structural  changes  mediated  by 
NR  antagonists  (11,  12):  ERa  agonists  and  antagonists  super¬ 
impose  well  and  engage  in  a  very  similar  network  of  hydrophobic 
and  electrostatic  contacts  with  the  receptor.  However,  in  the 
agonist-bound  conformation,  the  C-terminal  helix  H12  sits  like 
a  lid  on  top  of  the  ligand  (11)  (a  similar  observation  was  made 
for  virtually  all  of  the  NR  LED  structures  solved  so  far;  ref.  9). 
In  contrast,  the  two  ERa  antagonists  present  a  protruding  arm 
that  is  not  compatible  with  the  “closed  lid”  conformation  (11, 
12)  (Fig.  L4).  As  a  result,  helix  H12  is  pushed  away  from  the 
ligand  binding  site  and  relocates  in  the  coactivator-binding 
pocket  of  the  receptor  (Fig.  15)  (11).  Moreover,  the  LxxML 
motif  (where  L  is  a  leucine,  M  a  methionine,  and  x  any  residue) 
of  the  ERa  helix  H12  mimics,  and  probably  competes  with,  a 
LxxLL  helical  peptide  found  in  a  wide  variety  of  coactivator 
proteins.  The  alignment  of  the  LED  of  various  NRs  (13)  suggests 


that  a  common  structural  mechanism  would  be  for  the  antago¬ 
nists  to  induce  the  relocation  of  helix  H12  into  the  hydrophobic 
coactivator-binding  groove  of  the  receptor.  The  observation  that 
the  progesterone  receptor  antagonist  RU486  superimposes  with 
the  natural  hormone  progesterone,  but  presents  a  protruding 
arm  similar  to  that  of  tamoxifen  (14, 15)  provides  support  for  the 
universality  of  this  mechanism  of  antagonistic  activity. 

Our  goal  in  this  study  is  to  provide  further  evidence  for  this 
hypothesis  by  building  a  model  of  the  antagonist-bound  confor¬ 
mation  of  RARa,  a  NR  that  plays  an  important  role  in  the 
differentiation  and  proliferation  of  a  wide  variety  of  cell  types 
and  for  which  only  the  agonist  bound  conformation  is  known 
(16-18),  and  to  rationally  and  rapidly  identify  new  antagonists 
for  this  receptor.  We  built  a  model  of  the  antagonist-bound 
structure  of  RAR,  based  on  the  ERa/tamoxifen  complex  (12). 
The  model  was  used  for  the  virtual  screening  of  a  database  of 
>“150,000  available  compounds,  and  antagonist  candidates  were 
tested  in  vitro.  Two  novel  antagonists  and  a  novel  agonist  were 
discovered.  The  ligands  were  specific  for  RAR,  confirming  the 
validity  of  our  model  and  the  potential  therapeutic  application 
of  our  strategy. 

Materials  and  Methods 

Building  of  the  Model  of  Antagonist-Bound  RAR.  A  helical  peptide 
PLIREMLENP  corresponding  to  helix  H12  of  RARy  was 
docked  into  the  putative  coactivator  binding  pocket  of  another 
RARy  molecule.  We  hypothesized  that  the  IxxML  motif  con¬ 
tacts  the  coactivator  binding  site  of  the  receptor,  and  an  auto¬ 
matic  docking  procedure  was  carried  out  toward  this  site,  with 
flexible  protein  and  peptide  side  chains,  according  to  a  biased 
probability  Monte  Carlo  energy  minimization  procedure  (19, 
20).  Two  critical  features  of  the  interaction  between  the  LEDs  of 
NRs  and  their  coactivators  were  used  to  carry  out  the  docking: 
(0  The  “charge  clamp,”  initially  observed  in  the  complex 
between  SRC-1  and  peroxisome  proliferator-activated  receptor 
y  (21),  where  a  conserved  glutamate  (E414  in  RARy)  and  lysine 
(IQ46  in  RARy)  at  opposite  ends  of  the  hydrophobic  cavity  of 
the  receptor  contact  the  backbone  of  the  coactivator’s  LxxLL 
box,  enabled  the  orientation  of  the  helical  peptide.  («)  The 
finding  that  the  leucines  of  the  LxxLL  motif  of  SRC-1  are  buried 
in  the  hydrophobic  cavity  of  the  receptor  determines  which  side 
of  the  helix  faces  the  receptor.  Here,  the  isoleucine,  methionine, 
and  leucine  of  the  IxxML  motif  were  buried  in  the  binding  site 
of  RARy.  Loose  distance  restraints  were  set  between  the  charge 
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Fig.  1.  Modeling  of  the  antagonist-bound  structure  of  RAR.  Agonist  (white) 
and  antagonist  (cyan)  superimpose  in  the  binding  pocket  of  ERa,  but  the 
antagonist  presents  an  additional  protruding  arm  that  pushes  helix  12  (H12, 
green)  away  (A).  As  a  result,  H12  relocates  in  the  coactivator  binding  pocket 
of  the  receptor  (HI  2,  red)  (fl).  Based  on  the  ERa  structure,  helix  H12  of  RARy 
(red)  was  docked  to  the  coactivator  binding  pocket  of  the  RARy-LBD  (critical 
hydrophobic  residues  are  displayed  in  magenta)  (O.  and  the  C  terminus  of  the 
protein  was  remodeled  from  its  agonist-bound  conformation  (green)  to  its 
antagonist-bound  conformation  (red)  (D). 

clamp  of  the  receptor  (21)  (i.e.,  E414  and  K246)  and  backbone 
nitrogen  and  oxygens  of  the  peptide  (nitrogen  of  the  isoleucine 
on  one  end,  and  carbonyl  of  the  methionine,  leucine,  and 
asparagine  in  the  MLEN  motifs,  respectively).  The  energy  of  the 
complex  was  minimized  in  the  internal  coordinate  space  by  using 
the  modified  ECEPP/3  potentials.  The  subset  of  the  variables 
minimized  with  the  ICM  method  (19, 20, 22, 23)  included  the  side 
chains  of  the  receptor,  six  positional  variables  of  the  helix,  and 
the  side-chain  torsion  angles  of  the  helix. 

After  the  ICM  docking  procedure,  we  built  a  model  of  antag¬ 
onist-bound  RARy.  The  structure  of  the  receptor  was  kept  rigid 
but  for  the  side  chains  and  backbone  of  the  25  C-terminal 
residues  (corresponding  to  the  last  10  residues  of  helix  Hll,  the 
loop  from  Hll  to  H12,  and  H12),  and  for  the  side  chains  of  the 
putative  coactivator  binding  site  (within  6  A  of  the  previously 
docked  helical  peptide).  Tethers  then  were  set  between  the  C 
terminus  of  the  receptor  and  the  corresponding  residues  of  the 
docked  helical  peptide,  and  the  energy  of  the  receptor  was 
minimized  by  a  stochastic  global  energy  optimization  in  the 
internal  coordinate  space  (22,  23). 

The  last  step  was,  from  the  resulting  model  of  antagonist- 
bound  RARy,  to  derive  the  structure  of  the  antagonist-binding 
pocket  of  RARa:  the  three  nonidentical  residues  in  the  vicinity 
of  the  binding  pocket  (A234,  M272,  and  A397)  were  changed  to 
the  RARa  isoform  (S234,  1272,  and  V397,  respectively)  and 
energy-minimized.  Another  possibility  would  have  been  to  in¬ 
troduce  the  mutations  before  remodeling  the  C  terminus  of  the 
receptor.  We  preferred  to  proceed  as  described  here  to  preserve 
the  integrity  of  the  receptor  during  the  critical  remodeling  of  the 
C-terminal  end. 

Receptor-Ligand  Docking.  An  initial  docking  was  carried  out  with 
a  grid  potential  representation  of  the  receptor  and  flexible 
ligand  (24).  The  resulting  conformation  then  was  optimized  with 


a  full  atom  representation  of  the  receptor,  flexible  receptor  side 
chains,  and  flexible  ligand,  by  an  ICM  stochastic  global  optimi¬ 
zation  algorithm  as  implemented  in  the  MolSoft  ICM  2.7  program 
(23,  24). 

Screening  of  a  Virtual  Library  of  Compounds.  The  flexible-ligand/ 
grid-potential-receptor  docking  algorithm  (23,  24)  was  carried 
out  automatically  on  the  Available  Chemicals  Directory  library 
of  153,000  available  chemical  compounds  (MDL  Information 
Systems,  San  Leandro,  CA).  The  screening  took  less  than  a 
month  on  10  194-MHz  IP25  processors.  Each  compound  was 
assigned  a  score,  according  to  its  fit  with  the  receptor,  which  took 
into  account  continuum  as  well  as  discreet  electrostatics,  hydro- 
phobicity,  and  entropy  parameters  (25).  The  distribution  of  the 
compounds  according  to  their  score  is  presented  at  http:// 
abagyan.scripps.edu/PNAS/MS2000/.  All  compounds  scoring 
better  (i.e.,  lower)  than  -32  were  screened  further  for  the 
number  of  hydrogen  bonds  engaged  with  the  receptor.  The  134 
compounds  that  made  at  least  two  hydrogen  bonds  with  the 
receptor  were  preselected.  The  609  compounds  scoring  better 
than  -37  also  were  preselected,  regardless  of  the  hydrogen 
bonding  network.  This  preselection  pool  then  was  further  min¬ 
imized  with  a  full  atom  representation  of  the  receptor,  as 
described  above.  The  quality  of  the  fit  of  the  500  best-scoring 
compounds  then  was  visually  estimated,  and  32  compounds  were 
selected  for  biological  testing.  These  compounds  are  not  neces¬ 
sarily  the  ones  with  the  best  final  scores,  but  the  ones  we  thought, 
after  careful  visual  inspection,  presented  the  best  characteristics, 
such  as  Van  der  Waals  fit  or  hydrogen  bonding  (see  http:// 
abagyan.scripps.edu/PNAS/MS2000/). 

It  occurred  to  us  that  during  the  selection  by  the  MolSoft 
virtual  screening  procedure,  it  was  preferable  to  set  up  an  initial 
cut-off  value  poorly  selective  (i.e.,  -32)  to  recover  a  large  pool 
of  preselected  compounds  and  to  apply  to  this  pool  subsequent 
screens  specific  for  the  system,  such  as  number  of  hydrogen 
bonds  (used  here)  or  presence  of  a  hydrogen  bond  acceptor  (for 
example)  at  a  specific  point  of  space.  As  a  result,  we  derived  the 
value  -32  as  a  good  initial  threshold  (this  value  generates  an 
initial  pool  of  3,000-4,000  compounds). 

Biological  Activity  of  the  Antagonist  and  Agonist  Candidates.  HeLa 
cells  were  transfected  by  calcium  phosphate  precipitation  using 
I  jag  of  the  Gal4-responsive  chloramphenicol  acetyltransferase 
(CAT)  reporter  pMCl  10  and  1  /ag  of  Gal4-hR  ARo-LBD  or  1  jag 
of  Gal4-hRXRjS-LBD.  Studies  also  were  performed  with  the 
three  wild-type  hRAR  isoforms  (hRARa,  hRAR/3,  and 
hRARy)  by  using  a  AMTV-IR-CAT  reporter  as  described  (26, 
27).  Cell  cultures  were  supplemented  with  indicated  ligands 
immediately  after  addition  of  the  calcium  phosphate/DNA 
precipitate.  Media  and  ligands  were  replaced  after  24  h,  and  cells 
were  harvested  and  essayed  for  CAT  activity  24  h  later. 

Results 

Modeling  of  the  RAR  Antagonist  Binding  Pocket.  The  x-ray  structure 
of  RARy  bound  to  the  agonist  all-trans  RA  is  available  (18); 
however,  the  conformation  of  the  receptor  bound  to  an  antag¬ 
onist  is  not  known.  We  used  the  observations  made  from  the 
structure  of  ERa  bound  to  an  agonist,  17/3-estradiol  (11),  and 
two  antagonists,  tamoxifen  and  raloxifene  (11,  12),  to  build  a 
model  of  antagonist-bound  RAR  (Fig.  1 A  and  B).  We  docked 
helix  H12  of  RARy  into  the  putative  coactivator  binding  pocket 
of  the  receptor  as  described  (27)  (see  Materials  and  Methods  for 
details)  (Fig.  1C)  and  remodeled  the  25  C-terminal  residues, 
starting  near  the  end  of  helix  11,  through  an  extensive  global 
energy  minimization  procedure  (Fig.  ID). 

Docking  of  Known  RAR  Antagonists  into  the  Modeled  Receptor.  A  few 

RAR  antagonists  have  been  described  in  the  literature;  and 
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Fig.  2.  RAR  antagonists.  Two  known  antagonists  (A  and  8)  and  two  novel 
antagonists  (C and  0).  {Left)  Chemical  structure.  {Right)  Conformation  docked 
into  the  receptor  (part  of  the  receptor  is  displayed  as  a  ribbon  representation, 
and  the  binding  pocket  boundary  is  displayed  in  yellow).  Cyan,  carbons;  red, 
oxygen;  blue,  nitrogen;  magenta,  fluorine;  yellow,  sulfur.  Hydrogens  are  not 
represented  for  clarity. 

several  of  them  are  serious  candidates  for  cancer  therapy  (2, 28). 
A  well-characterized  ligand  is  AGN193109,  which  inhibits  the 
three  RAR  isoforms  at  nanomolar  concentrations  (29).  Another 
very  potent  antagonist  is  MX781,  which  is  effective  against 
ERa-positive  and  -negative  breast  cancer  cells,  with  no  apparent 
toxicity  (2).  The  activity  of  these  two  ligands  has  been  presented 
in  detail,  but  no  structural  information  has  been  reported  on 
their  mode  of  interaction  with  the  receptor.  We  built  a  model  of 
RARy  complexed  either  with  AGN193109  or  MX781,  by  using 
our  flexible  docking  algorithm  (24)  (Fig.  2  A  and  B).  In  both 
cases,  the  antagonist  superimposed  with  the  agonist  all-trans 
RA.  As  observed  for  ERa,  the  antagonists  also  presented  a 
protruding  arm,  which  was  absent  in  RAR  agonists.  Very 
importantly,  this  protruding  arm  coincided  exactly  with  the 
single  opening  in  the  ligand  binding  pocket  of  our  modeled 
receptor,  generated  by  the  displacement  of  helbt  H12  (Fig.  2  A 
and  B),  and  made  stabilizing  hydrophobic  contacts  with  the 
protein.  It  is  very  unlikely  that  this  perfect  fit,  observed  for  both 
antagonists,  was  fortuitous.  On  the  contrary,  this  feature  mimics 
the  inactivation  mechanism  revealed  by  the  crystal  structure  of 
ERa  bound  to  tamoxifen  and  raloxifene.  Therefore,  our  docking 
results  of  AGN193109  and  MX781  very  strongly  suggest  that:  (/) 
the  structural  mechanisms  of  antagonist  activity  for  ERa  are 
shared  by  other  NRs,  and  («)  our  model  of  the  RAR  antagonist 
binding  pocket  could  be  used  to  design  novel  antagonists. 

Screening  of  a  Virtual  Library  and  Discovery  of  Novel  RAR  Antagonists. 

High  throughput  functional  screening  currently  is  the  most  used 
method  for  the  discovery  of  receptor-specific  ligands.  Although 
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Fig.  3.  Functional  assays  of  the  novel  antagonists.  HeLa  cells  were  trans¬ 
fected  with  a  Gal4-hRARa-LBD  expression  vector  and  a  Gal4-CAT  reporter 
gene  (results  were  similar  in  studies  using  the  three  hRAR  isoforms).  The  cells 
were  incubated  with  5  nM  all-trans  RAto  stimulate  CAT  activity,  and  the  effect 
of  each  antagonist  on  inhibiting  CAT  was  examined  at  2  and  20  /.iM  concen¬ 
tration  (the  known  antagonist  RO-41-5253  was  used  as  a  positive  control). 

efficient,  it  requires  the  physical  availability  and  management  of 
hundreds  of  thousands  of  chemical  compounds.  In  the  present 
work,  we  used  a  virtual  library  composed  of  the  predicted 
structure  of  more  than  150,000  available  compounds  (see  Ma¬ 
terials  and  Methods).  Each  compound  was  automatically  docked 
in  a  grid  representation  of  the  modeled  RARa  antagonist 
binding  pocket.  Five  grid  potentials  carried  information  on  the 
shape,  hydrophobicity,  electrostatics,  and  hydrogen-bonding 
availability  of  the  receptor,  and  enabled  a  rapid  docking  simu¬ 
lation  (24,  25).  RARa  was  selected  over  the  other  two  isoforms 
(RARp  and  RARy)  because  recent  data  suggests  it  could  be  a 
medically  more  relevant  target  (28).  After  an  automatic  selection 
procedure  with  flexible  ligands,  and  optimization  of  the  selected 
candidates  with  flexible  protein  side  chains  (see  Materials  and 
Methods  for  details),  32  compounds  were  considered  as  potential 
antagonists  of  RARa  and  ordered. 

To  test  these  compounds  in  vitro,  HeLa  cells  were  transfected 
with  a  Gal4-hRARa-LBD  expression  vector  and  a  Gal4-CAT 
reporter  gene  (26).  Studies  also  were  performed  with  the  three 
wild-type  hRAR  isoforms  and  a  AMTV-IR-CAT  reporter  (26, 
27).  These  gave  similar  results  as  those  found  with  Gal4- 
hRARa-LBD  (data  not  shown).  The  cells  were  incubated  with 
all-trans  RA  to  stimulate  CAT  activity,  and  the  effect  of  each 
antagonist  candidate  on  inhibiting  CAT  stimulation  by  all-trans 
RA  was  examined.  Possible  toxicity  of  the  compounds  was 
deduced  from  the  amount  of  cellular  protein  extract  after  2  days 
of  incubation.  Two  antagonist  candidates  inhibited  CAT  activity 
by  55%  and  33%  at  20  ju,M  with  no  apparent  toxicity  (Fig.  3).  The 
Gal4-hRARa  activity  illustrated  in  Fig.  3  was  equivalent  for  the 
other  two  RAR  isoforms  (data  not  shown).  No  inhibition  was 
observed  when  CAT  expression  was  under  the  control  of  a 
Gal4-mRXRp-LBD  fusion  construct,  indicating  that:  (i)  the 
antagonists  are  specific  for  RAR,  and  («)  the  inhibition  is  caused 
by  an  interaction  with  the  Gal4-RAR-LBD  fusion  protein  and 
does  not  result  from  some  nonspecific  effect  on  CAT  activity 
(data  not  shown). 

The  two  RAR  antagonists  dock  into  the  ligand  binding  pocket 
of  the  receptor  (Figs.  2  C  and  D  and  4).  As  observed  for 
AGN193109  and  MX781,  they  fit  in  the  same  binding  pocket  as 
the  natural  agonist  all-trans  RA,  but  present  an  additional  arm, 
which  protrudes  out  of  the  pocket.  Antagonist  1  has  a  tri-fluoro 
group  where  the  retinoid  receptor  ligands  usually  carry  a  car- 
boxylate  group  (in  antagonist  2,  the  corresponding  domain  is 
truncated).  In  our  model,  antagonist  2  engages  in  a  hydrogen 
bond  with  Ser-234  of  the  hRARa  (Fig.  4B).  However,  the  S234A 
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Fig.  4.  Novel  RAR  antagonists.  (A  and  S)  Stereo  representation  of  antago¬ 
nists  1  and  2  docked  into  the  binding  site  of  the  receptor.  The  ligands  make 
extensive  hydrophobic  Interactions  with  residues  from  helix  3,  helix  5,  and 
helix  11.  Antagonist  2  (S)  is  engaged  in  an  additional  hydrogen  bond  with 
Ser-234  of  helix  3  and  contacts  the  remodeled  C  terminus  (red)  at  Pro-405.  (C 
and  D)  The  fit  of  antagonists  1  and  2  into  the  receptor  binding  pocket  isshown. 


mutation  in  the  other  two  isoforms  does  not  alter  the  ligand 
antagonist  activity,  suggesting  that  this  hydrogen  bond  is  not 
essential  for  the  interaction.  An  obvious  way  to  increase  the 
affinity  of  these  antagonists  would  be  to  substitute  the  tri-f  luoro 
group  by  a  carboxylate  in  antagonist  1  or  elongate  and  add  a 
carboxylate  to  antagonist  2,  which  would  result  in  more  stabi¬ 
lizing  interactions  with  two  conserved  arginines  of  the  receptor. 
However,  the  purpose  of  this  work  is  to  provide  evidence  that  the 
rational  design  of  antagonists  from  the  model  of  the  inactive 
receptor  is  feasible  and  not  to  optimize  the  affinity  of  the 
compounds.  The  in  vitro  functional  assays  provide  evidence  that 
our  modeling  scheme  is  relevant  and  can  be  used  to  design  novel 
antagonists  of  NRs. 

We  applied  the  same  strategy  to  discover  agonists,  by  using  the 
crystal  structure  of  the  active  conformation  of  RARy  (18),  and 
could  discover  three  novel  agonists  10-25%  active  at  200  nM  and 
fully  active  at  20  jaM,  of  30  compounds  tested  (data  not  shown). 

Screening  of  a  Database  of  Known  Ligands.  To  assess  the  quality  of 
our  setup  of  the  ICM  screening  algorithm  (23),  we  built  a  small 


virtual  database  made  up  of  antagonists  and  agonists  for  differ¬ 
ent  members  of  the  NR  family  (Table  1).  We  screened  this 
database  with  our  model  of  antagonist-bound  RAR,  as  we  did 
for  the  Available  Chemicals  Directory  database.  The  screening 
was  repeated  four  times,  to  test  the  reproducibility  of  our 
method.  Table  1  shows  that  for  each  ligand  the  score  varies  a  lot 
from  one  screening  to  the  other.  This  finding  reflects  the 
generation  of  different  ligand  conformations  from  one  docking 
simulation  to  another  (data  not  shown)  and  represents  the 
limitation  of  our  method,  as  discussed  below. 

Table  1  lists  as  “selected”  the  ligands  that  met  with  the  criteria 
for  preselection  and  final  inspection  during  the  Available  Chem¬ 
icals  Directory  screening  (i.e.,  score  better  than  -37  or  score 
better  than  -32  and  at  least  two  hydrogen  bonds  with  the 
receptor;  see  Materials  and  Methods  for  details).  Seven  of  the 
nine  known  RAR  ligands  (i.e.,  «>80%)  and  one  of  the  six 
non-RAR  ligands  (i.e.,  '»16%)  were  selected.  The  fact  that  RAR 
agonists,  as  well  as  antagonists,  produced  good  scores  was 
expected,  because  the  binding  pocket  used  for  the  screening  is 
equivalent  to  the  agonist  binding  pocket,  with  an  additional 
opening  generated  by  the  remodeling  of  the  C  terminus  of  the 
receptor.  The  two  false  negatives,  AGN193836  and  Ro415253, 
were  missed  because  of  steric  clashes,  as  discussed  below. 
Antagonist  1  was  not  found  either,  reflecting  its  rather  low 
affinity  for  the  receptor.  It  is  important  to  underline  here  that 
we  do  not  expect  to  detect  all  of  the  true  binders.  The  algorithm 
was  rather  designed  to  minimize  the  number  of  false  positives, 
which  correlates  with  the  number  of  unnecessary  in  vitro 
experiments  (25). 

In  that  respect,  the  presence  of  one  false  positive  of  six 
nonbinders  could  be  alarming,  because  such  a  ratio  would 
represent  about  25,000  false  positives  of  a  database  of  150,000 
compounds.  However,  the  binding  pockets  of  the  NRs  repre¬ 
sented  in  this  database  are  close  in  size  and  shape;  as  a  result,  the 
database  used  for  this  benchmark  was  composed  of  molecules 
presenting  strong  similarities  with  RAR  ligands.  Therefore,  we 
believe  this  ratio  is  not  representative.  The  fact  that  we  needed 
to  test  only  32  molecules  to  discover  three  novel  RAR  ligands 
confirms  this  assumption. 

Next,  we  tried  to  address  why  some  ligands,  such  as  Ro415253, 
were  repeatedly  missed  by  our  screening  algorithm  (Ro415253 
was  still  not  selected  after  10  docking  simulations,  data  not 
shown).  We  hypothesized  that  the  ligand  could  not  fit  into  the 
potential  maps  generated  from  our  model  and  carried  out  a 
docking  simulation  with  a  full  atom  representation  of  the 
receptor,  according  to  a  Monte  Carlo  energy  minimization  of  the 
complex,  with  both  flexible  ligand  and  flexible  receptor  side 
chains  (24).  This  docking  simulation  produced  a  solution  were 
the  ligand  fits  into  the  binding  pocket;  the  core  of  the  ligand 
(from  the  carboxylate  to  the  internal  sulfone)  superimposes  with 
agonists  such  as  all-trans  RA,  whereas  the  alkyl  arm  sticks  out 
of  the  pocket,  as  previously  described  for  the  other  antagonists 
(data  not  shown).  The  conformation  of  several  receptor  side 
chains  was  modified  during  the  docking  simulation,  to  accom¬ 
modate  the  size  of  the  ligand,  and  this  solution  would  not  have 
been  found  with  rigid  side  chains.  This  finding  suggests  that 
Ro415253  could  not  fit  into  the  potential  maps  generated  from 
the  original  receptor  conformation,  which  we  used  for  the 
screening.  We  generated  a  new  series  of  potential  maps  from  the 
optimized  receptor  structure  and  screened  the  small  database  of 
known  ligands  with  these  maps  four  times  as  above  (Table  1). 
The  score  assigned  to  Ro415253  was  twice  lower  (i.e.  better) 
than  the  threshold.  Surprisingly,  this  new  series  of  potential  maps 
totally  eliminated  the  presence  of  both  false  positive  and  false 
negative  (all  RAR  ligands  and  only  RAR  ligands  were  selected). 
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Table  1.  Control  screening  of  known  NR  ligands 


Ligand 

Activity 

Score  1 

Score  2 

Score  3 

Score  4 

Selected 

Binding 

References 

First  series 

AGN1 93836 

RAR.agonist 

-19.9 

-9.04 

-20.6 

-19.7 

_ 

+ 

(33) 

ATRA 

RAR  pan-agonist 

-46.4 

-41 

-41.7 

-41. 

+ 

+ 

(34) 

R0415253 

RAR_antagonist 

-25.5 

-22. 

-28.3 

-28.6 

- 

+  . 

(28) 

MX781 

RAR  antagonist 

-28. 

-23.9 

-27.1 

-36.4 

+ 

+ 

(2) 

CD2366 

RAR  pan-antagonist 

-28.5 

-23.3 

-30.9 

-32.3 

+ 

•f 

(34) 

Targretin 

RXR  pan-agonist 

-17.9 

-18.1 

-19.1 

-18.6 

- 

- 

(4) 

SR1 1203 

RXR  pan-agonist 

-27.5 

-27. 

-27. 

-27.2 

- 

- 

(34) 

Tamoxifen 

ER  modulator 

-29.3 

-27.5 

-29.8 

-28.3 

- 

- 

(23) 

Raloxifene 

ER  modulator 

-23.4 

-20.8 

-26.7 

-34.6 

+ 

- 

(22) 

RU486 

Progest  Rec  antag. 

-21.2 

-21.3 

-21.4 

-21.3 

- 

- 

(25) 

9cisRA 

RAR/RXR  agonist 

-32.5 

-32.6 

-32.9 

-16.9 

+ 

+ 

(34) 

AGN193109 

RAR  pan-antagonist 

-39.2 

-56. 

-57.4 

-39.4 

+ 

+ 

(29) 

AGNpartia 

RAR  partial  agonist 

-54.4 

-54.3 

-49.5 

-29.1 

+ 

+ 

(29) 

Am580 

RAR_agonist 

-34.2 

-34.4 

-34.8 

-34.5 

+ 

+ 

(34) 

EM652 

ER  antagonist 

-27. 

-27.4 

-21.7 

-28.8 

“ 

- 

(35) 

Antagonist  1 

Novel  RAR  antag. 

-28.5 

-28.1 

-28.7 

-28.8 

- 

+ 

(35) 

Antagonist  2 

Novel  RAR  antag. 

-27.6 

-38.9 

-40.2 

-26.3 

+ 

+ 

(35) 

Second  series 

AGN 193836 

RAR_agonist 

-37.2 

-36.5 

-36.7 

-35.3 

+ 

+ 

(33) 

ATRA 

RAR  pan-agonist 

-51.7 

-52.6 

-51.8 

-52.0 

+ 

+ 

(34) 

R0415253 

RAR_antagonist 

-28.9 

-24.4 

-39.0 

-46.6 

+ 

+ 

(28) 

MX781 

RAR  antagonist 

-45.3 

-48.0 

-40.2 

-45.6 

+ 

+ 

(2) 

CD2366 

RAR  pan-antagonist 

-50.7 

-50.8 

-29.3 

-29.3 

+ 

+ 

(34) 

Targretin 

RXR  pan-agonist 

-25.4 

-23.0 

-22.2 

-31.0 

- 

(4) 

SR1 1203 

RXR  pan-agonist 

-28.2 

-22.7 

-22.1 

-27.5 

- 

- 

(34) 

Tamoxifen 

ER  modulator 

-26.4 

-24.6 

-30.3 

-23.4 

- 

- 

(23) 

Raloxifene 

ER  modulator 

-15.6 

-23.7 

-18.4 

-17.4 

- 

- 

(22) 

RU486 

Progest  Rec  antag. 

-21.4 

-20.6 

-20.3 

-20.1 

- 

- 

(25) 

9cisRA 

RAR/RXR  agonist 

-38.8 

-39.5 

-33.5 

-38.7 

+ 

+ 

(34) 

AGN193109 

RAR  pan-antagonist 

-55.1 

-55.5 

-41.2 

-54.8 

+ 

+ 

(29) 

AGNpartia 

RAR  partial  agonist 

-61.4 

-61.3 

-61.4 

-61.0 

+ 

+ 

(29) 

Am580 

RAR_agonist 

-46.6 

-47.2 

-46.6 

-46.5 

+ 

+ 

(34) 

EM652 

ER  antagonist 

-26.3 

-23.1 

-23.7 

-27.3 

- 

- 

(35) 

Antagonist  1 

Novel  RAR  antag. 

-32.1 

-32.1 

-31.7 

-31.6 

+ 

+ 

(35) 

Antagonist  2 

Novel  RAR  antag. 

-33.3 

-29.7 

-33.8 

-33.8 

+ 

+ 

(35) 

First  series:  A  similar  screening  as  the  one  performed  on  the  ACD  database  was  carried  out  four  times  on  a  small  database  made  of  known  RAR  antagonists, 
agonists,  as  well  as  ligands  for  other  NRs  and  the  two  novel  RAR  antagonists.  The  ligands  that  met  at  least  once  with  the  criteria  tor  selection  used  during  the 
ACD  screening  are  listed  as  Selected.  The  ligands  that  are  experimentally  binding  to  RAR  are  listed  as  Binding.  Second  series:  Screening  of  known  ligands  after 
adjustment  of  the  receptor's  binding  pocket  conformation.  The  RAR  antagonist  Ro415253  was  docked  into  our  model  of  antagonist-bound  RAR  with  flexible 
receptor  side  chains  and  ligand.  The  resulting  receptor  conformation  was  used  for  a  novel  screening. 


Discussion 

In  this  study,  we  presented  a  strategy  for  the  discovery  of 
antagonists,  as  well  as  agonists,  for  NRs,  which  are  very  impor¬ 
tant  targets  for  drug  design.  An  important  aspect  of  our  ap¬ 
proach  was  to  exclude  any  preconceived  pharmacophore  bias 
from  our  database  screening.  Most  drug  design  strategies  impose 
chemical  constraints  on  the  selected  molecule  to  conserve  the 
functional  groups  believed  to  be  most  important  in  existing 
ligands,  preventing  the  discovery  of  novel  ligand  types.  In  the 
present  work,  we  avoided  pharmacophore  constraints  thanks  to 
a  robust  flexible  docking  program  and  scoring  function:  the  only 
filters  used  for  screening  were  a  good  fit  with  the  receptor  and 
reasonable  bioavailability  parameters  (30).  As  a  result,  we 
discovered  novel  original  ligands  that  could  be  further  optimized 
into  potent  RAR-selective  antagonists  and  agonists. 

A  limitation  of  our  method,  which  leaves  room  for  further 
improvement,  is  that  a  compromise  must  be  made  between  the 
time  allocated  for  each  ligand  (less  than  2  min  on  one  processor 
here)  and  the  reliability  of  the  sampling  of  the  conformational 
space.  Indeed,  Table  1  shows  that  four  runs  for  each  ligand  are 
necessary  to  minimize  efficiently  missed  hits  (the  remaining 


missed  positives  were  not  selected  because  of  inappropriate 
receptor  side-chain  conformations  and  not  because  of  an  insuf¬ 
ficient  sampling).  Improvement  of  the  computing  power,  the 
docking  algorithm,  and  the  scoring  function  all  could  result  in  a 
more  robust  virtual  database  screening. 

Another  drawback  is  that  the  conformation  of  the  receptor  is 
not  necessarily  unique,  but  can  vary  from  one  ligand  to  another. 
As  a  result,  a  ligand  that  fits  in  receptor  conformation  A  will 
never  be  found  if  receptor  conformation  B  is  used  for  the 
screening.  The  case  of  Ro415253  illustrates  this  issue  well:  this 
known  antagonist  was  never  selected,  even  after  10  trials, 
because  the  binding  pocket  used  for  the  screening  was  too 
narrow.  The  potential  maps  used  for  the  screening  have  a 
smoother  van  der  Waals  profile  than  the  atomic  representation 
of  the  receptor;  as  a  result,  the  maps  are  more  tolerant  regarding 
steric  clashes  with  the  ligand.  However,  the  degree  of  tolerance 
is  limited  and  cannot  accommodate  important  conformational 
changes  of  the  receptor  side  chains  (or  backbone,  obviously). 
When  new  potential  maps  generated  from  a  model  of  RAR 
bound  to  Ro415253  were  used  for  screening,  the  three  RAR 
ligands  missing  from  the  first  screening  were  selected  (Table  1). 
This  finding  confirms  that  the  initial  conformation  of  the 
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receptor  prevented  the  selection  of,  or  reduced  the  chances  of 
selecting,  some  known  RAR  ligands.  The  false  positive  ralox¬ 
ifene  (Table  1)  was  making  extensive  van  der  Waals  interactions 
with  the  narrow  RAR  binding  pocket,  which  compensated  for 
the  lack  of  stabilizing  electrostatic  interactions.  However,  in  the 
new  conformation  of  the  receptor  (Table  1),  the  binding  pocket 
is  wider  and  the  fit  not  as  tight.  As  a  result,  raloxifene  was  not 
selected.  This  observation  emphasizes,  if  necessary,  that  virtual 
screening  is  very  sensitive  to  the  conformation  of  the  receptor. 

In  that  respect,  it  is  interesting  to  note  that  the  topology  of  the 
remodeled  C-terminal  loop  is  probably  not  unique,  and  that  the 
conformation  used  to  generate  the  receptor  potential  maps  was 
one  among  many  others.  It  is  therefore  legitimate  to  wonder 
whether  novel  antagonists  could  not  be  discovered  as  efficiently 
from  a  structure  of  the  receptor  where  the  C  terminus,  instead 
of  being  remodeled,  was  truncated.  This  brings  up  a  fundamental 
question:  is  the  role  of  antagonists  only  to  antagonize  the  “closed 
lid”  conformation  where  helbc  H12  sits  on  top  of  the  ligand 
binding  pocket,  or  are  they  also  stabilizing  the  inactive  confor¬ 
mation  of  the  receptor?  It  is  important  to  keep  it  mind  that  the 
C-terminal  tail  of  RAR  (as  well  as  for  other  NRs)  is  a  very 
dynamic  entity  when  no  ligand  is  bound  to  the  receptor  and 
probably  oscillates  between  active  and  inactive  conformations. 
Once  bound  in  the  ligand  binding  pocket,  agonists  contact  the 
H12  helix  and  lock  the  receptor  in  its  coactivator-binding 
conformation.  Likewise,  it  is  reasonable  to  speculate  that  an¬ 
tagonists  would  contact  the  C-terminal  tail  of  the  receptor  and 
stabilize  the  inactive  state.  However,  it  is  probable  that  the 
conformation  of  the  receptor  varies  from  one  ligand  to  another; 
indeed,  recent  results  on  ERa  show  that  different  ligands  induce 
distinct  conformational  change  of  the  receptor  (31).  We  used  the 
crystal  structure  of  ERo  bound  to  tamoxifen  to  build  our  model 
of  inactive  RAR  and  could  find  two  specific  antagonists,  one  of 
which  contacts  the  remodeled  tail  of  the  receptor.  Although  the 

1.  Dees,  E.  C.  &  Kennedy,  M.  J.  (1998)  Curr.  Op'm.  Oncol.  10,  517-522. 

2.  Fanjul,  A.  N.,  Piedrafita,  F.  J.,  Al-Shamma,  H.  &  Pfahl,  M.  (1998)  Cancer  Res. 
58,  4607-4610. 

3.  Shiohara,  M.,  Dawson,  M.  I.,  Hobbs,  P.  D.,  Sawai,  N.,  Higuchi,  T.,  Koike,  K., 
Komiyama,  A.  &  Koeffler,  H.  P.  (1999)  Blood  93,  2057-2066. 

4.  Bischoff,  E.  D.,  Moon,  T.  E.,  Heyman,  R.  A.  &  Lamph,  W.  W.  (1998)  Cancer 
Res.  58,  479-484. 

5.  Mukheijee,  R.,  Davies,  P.  J.,  Patemiti,  J.  R.,  Jr.  &  Heyman,  R.  A.  (1997)  Nature 
(London)  386,  407-410. 

6.  Spiegelman,  B.  M.  (1998)  Diabetes  47,  507-514. 

7.  Elstner,  E.,  Muller,  C.,  Koshizuka,  K.,  Williamson,  E.  A.,  Park,  D.,  Asou,  H., 
Shintaku,  P.,  Said,  J.  W.,  Heber,  D.  &  Koeffler,  H.  P.  (1998)  Proc.  Natl.  Acad. 
Sci.  USA  95,  8806-8811. 

8.  Zetterstrom,  R.  H.,  Solomin,  L.,  Jansson,  L.,  Hoffer,  B.  J.,  Olson,  L.  & 
Perlmann,  T.  (1997)  Science  276,  248-250. 

9.  Moras,  D.  &  Gronemeyer,  H.  (1998)  Curr.  Opin.  Cell  Biol.  10,  384-391. 

10.  Klaholz,  B.  P.,  Renaud,  J.  P.,  Mitschler,  A.,  Zusi,  C.,  Chambon,  P.,  Gronem¬ 
eyer,  H.  &  Moras,  D.  (1998)  Nat.  Struct.  Biol.  5,  199-202. 

11.  Brzozowski,  A.  M.,  Pike,  A.  C.,  Dauter,  Z.,  Hubbard,  R.  E.,  Bonn,  T., 
Engstrom,  O.,  Ohman,  L.,  Greene,  G.  L.,  Gustafsson,  J.  A.  &  Cariquisi,  M. 
(1997)  Nature  (London)  389,  753-758. 

12.  Shiau,  A.  K.,  Barstad,  D.,  Loria,  P.  M.,  Cheng,  L.,  Kushner,  P.  J.,  Agard,  D.  A. 
&  Greene,  G.  L.  (1998)  Cell  95,  927-937. 

13.  Wurtz,  J.  M.,  Bourguet,  W.,  Renaud,  J.  P.,  Vivat,  V.,  Chambon,  P.,  Moras,  D. 
&  Gronemeyer,  H.  (1996)  Nat.  Struct.  Biol.  3,  87-94. 

14.  Cadepond,  F.,  Ulmann,  A.  &  Baulieu,  E.  E.  (1997)  Annu.  Rev.  Med.  48, 
129-156, 

15.  Williams,  S.  P.  &  Sigler,  P.  B.  (1998)  Nature  (London)  393,  392-396. 

16.  Fitzgerald,  P.,  Teng,  M.,  Chandraratna,  R.  A.,  Heyman,  R.  A.  &  Allegretto, 
E.  A.  (1997)  Cancer  Res.  57,  2642-2650. 

17.  Giannini,  G.,  Dawson,  M.  I.,  Zhang,  X.  &  Thiele,  C.  J.  (1997)  J.  Biol.  Chem. 
272,  26693-26701. 

18.  Renaud,  J.  P.,  Rochel,  N.,  Ruff,  M.,  Vivat,  V.,  Chambon,  P.,  Gronemeyer,  H. 
&  Moras,  D.  (1995)  Nature  (London)  378,  681-689. 


conformation  we  used  for  the  C-terminal  tail  was  probably  not 
the  only  possible  one,  we  believe  that  its  presence  was  important 
to  bias  the  screening  toward  compounds  that  actually  do  contact 
the  flexible  arm  of  RAR,  as  well  as  to  impose  a  reasonable 
boundary  on  the  antagonist  binding  pocket,  and  prevent  the 
ligands  from  drifting  out  of  the  pocket  during  the  docking 
simulations. 

An  important  point  was  to  demonstrate  that  we  could  discover 
novel  antagonists  for  a  NR  other  than  ERa,  provided  that  the 
structure  of  the  agonist-bound  active  form  of  the  protein  was 
known.  Rational  design  of  ligands  from  a  model  of  a  receptor  is 
thought  by  many  to  yield  very  low  success  rates.  The  present 
study  demonstrates  that  this  strategy  can  be  successfully  under¬ 
taken  with  appropriate  biological  systems  and  robust  modeling 
tools.  Moreover,  targeting  models  of  diverse  members  of  the  NR 
family  could  be  further  justified  by  the  wealth  of  structural  and 
sequence  information  (9,  13),  as  well  as  the  finding  that  NR 
family  members  share  similar  mechanisms  of  transcriptional 
activation  and  inhibition  (9). 

The  recent  publication  of  the  crystal  structures  of  medically 
relevant  receptor  targets,  such  as  peroxisome  proliferator-activated 
receptor  y  (21),  RAR  (18),  RXR  (32),  ERa  (11),  or  progesterone 
receptor  (15),  has  created  an  exciting  opportunity  for  the  discovery 
of  novel  ligands.  This  study  demonstrates  that  the  rational  design  of 
both  antagonists  and  agonists,  by  using  computer-generated  models 
based  on  these  structures,  is  possible. 
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Abstract _ 

Background:  Several  Retinoic  Acid  Receptors  (RAR)  agonists  have  therapeutic  activity  against  a 
variety  of  cancer  types;  however,  unacceptable  toxicity  profiles  have  hindered  the  development  of 
drugs.  RAR  agonists  presenting  novel  structural  and  chemical  features  could  therefore  open  new 
avenues  for  the  discovery  of  leads  against  breast,  lung  and  prostate  cancer  or  leukemia. 

Results:  We  have  analysed  the  induced  fit  of  the  active  site  residues  upon  binding  of  a  known 
ligand.  The  derived  binding  site  models  were  used  to  dock  over  150,000  molecules  in  silico  (or 
virtually)  to  the  structure  of  the  receptor  with  the  Internal  Coordinates  Mechanics  (ICM)  program. 
Thirty  ligand  candidates  were  tested  in  vitro. 

Conclusions:  Two  novel  agonists  resulting  from  the  predicted  receptor  model  were  active  at  50 
nM.  One  of  them  displays  novel  structural  features  which  may  translate  into  the  development  of 
new  ligands  for  cancer  therapy. 


Background 

The  retinoic  acid  receptors  (RAR-a,  -P,  and  -7)  are  tran¬ 
scription  factors  regulating  a  variety  of  endocrine  meta¬ 
bolic  pathways.  Unlike  anti -estrogens,  such  as  tamoxifen 
or  raloxifene,  ligands  targeted  against  the  RAR  isoforms 
can  present  anticancer  activity  against  both  estrogen  re¬ 
ceptor  positive  and  negative  breast  tumor  cells  [1].  As  a 
result,  such  molecules  could  constitute  a  novel  genera¬ 
tion  of  drugs  against  breast  cancer.  For  reasons  not  yet 
clear,  both  agonists  and  antagonists  of  RAR  can  present 
anti-tumor  activity  against  breast,  prostate,  lung  cancer 
or  leukemia  [1,2,3,4,5,6,71.  The  development  of  both 
types  of  ligands  could  therefore  have  important  biomed¬ 


ical  implications.  We  have  recently  demonstrated  that 
antagonists  could  be  discovered  rationally,  based  on  a 
model  of  the  antagonist-bound  conformation  of  the  re¬ 
ceptor  [8].  Our  goal  here  is  to  discover  innovative  molec¬ 
ular  structures  with  RAR  agonist  activity. 

Several  retinoid  and  non-retinoid  ligands  have  been  de¬ 
scribed,  which  activate  one  or  a  combination  of  RAR  iso¬ 
forms.  Some  of  them,  such  as  the  natural  hormone  all- 
trans  retinoic  acid  (all-trans  RA)  (Fig.  la),  have  been 
tested  clinically,  and  display  unacceptable  side  effects, 
such  as  skin  dryness,  cheilitis,  hypertriglyceridemia  and 
conjunctivitis  [9,10].  However,  the  compounds  tested  so 
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Figure  I 

Chemical  structure  of  the  RAR  ligands  discussed  in  this  work.  A:  All-trans  retinoic  acid.  B:  4-  [(5,6,7,8-tetrahydro-  5, 5,8,8- 
tetramethyl-2-  naphthalenyl)  carboxamide]  benzoic  acid  (Am580).  C:  2-  (4-  carboxyphenyl)-  4-  (3-trifluoromethylphenyl)  thia- 
zole  (Agonist  I).  D:  4-  (3-(3,5-  di-tert-butyl-  4-hydroxyphenyl)-3-  oxopropenyl)  benzoic  acid  (Agonist  2)  E:  3,5-di-tert-butyl- 
chalcone  4'-carboxylic  acid  (Ch55) 


far  belong  to  limited  series  of  related  structures.  An  in¬ 
creasing  amount  of  data  suggests  that  the  RAR-P  iso¬ 
form,  which  is  under  the  transcriptional  control  of  RAR- 
a,  is  involved  in  suppressing  cell  growth  and  tumori- 
genicity  [11,12,13,14,15,16].  Innovative  molecules  with 
RAR-a  and  RAR-P  agonist  activity  could  therefore 
present  more  favorable  toxicity  profile  than  pan-ago¬ 
nists. 

We  applied  a  flexible  virtual  screening  algorithm  (Mol- 
soft  ICM,  virtual  library  screening  module  [17])  which 
rapidly  docks  hundreds  of  thousands  of  flexible  com¬ 
pound  structures  into  the  ligand  binding  pocket  of  RAR, 
and  discovered  two  novel  RAR-P  selective  agonists.  One 
of  these  ligands  displays  original  structural  and  chemical 
characteristics,  which  could  be  used  in  the  development 
of  novel  compounds  for  cancer  prevention  and  therapy. 


Results  and  discussion 

We  first  built  a  model  of  the  RAR-a  agonist  binding 
pocket  from  the  crystal  structure  of  the  RAR-y  ligand 
binding  domain  (RAR-y  LBD)  /  all-trans  RA  complex 
[18] .  All  but  three  amino  acids  in  the  vicinity  of  the  ligand 
are  conserved  between  the  two  isoforms.  These  three 
non-identical  residues  -A234,  M272,  and  A397-  were 
changed  to  the  RAR-a  isoform  -S234,  I272  and  V397- 
and  the  energy  of  the  system  was  minimized  (see  "Mate¬ 
rials  and  Methods"). 

In  order  to  address  the  aceuracy  of  our  model  of  the 
RAR-a  binding  pocket,  we  doeked  AmsSo,  an  RAR-a 
speeific  agonist  [19],  into  the  reeeptor  (the  chemical 
structure  of  AmsSo  is  shown  Fig.  ib).  A  rapid  docking 
procedure  with  flexible  ligand  and  a  grid  representation 
of  the  receptor  was  followed  by  an  extensive  Monte  Carlo 
energy  minimization  with  both  ligand  and  receptor  side 
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Figure  2 

Docking  of  a  known  RAR-a  specific  agonist.  The  RAR-a 
selective  agonist  AmSSO  was  docked  into  the  modeled  ligand 
binding  pocket  of  RAR-a  .  A;  The  complexed  ligand  (white 
sticks)  superimposes  with  the  crystal  structure  of  bound  all- 
trans  RA  (green).  Hydrogens  are  not  shown  for  clarity.  B: 
AmSSO  (CPK  display)  fits  tightly  into  the  receptor's  pocket 
(yellow  wire),  but  for  a  ketone  oxygen,  which  shares  an 
hydrogen  with  Ser234  of  the  receptor  (displayed  as  stick). 
The  receptor  in  the  vicinity  of  the  ligand  is  shown  as  a  white 
ribbon.  Carbons,  hydrogens,  oxygens  and  nitrogen  are 
colored  white,  gray,  red  and  blue  respectively.  (Image  gener¬ 
ated  with  Molsoft  ICM) 


chains  flexible  (see  "Materials  and  Methods"  for  details). 
The  ligand  superimposed  well  with  the  natural  hormone 
all-trans  RA  (Fig.  2a).  Interestingly,  AmsSo  does  not 
seem  to  fit  in  the  receptor  binding  site:  the  ketone  oxygen 
of  the  ligand  sticks  out  of  the  binding  pocket,  due  to  too 
close  proximity  of  residue  234  (Fig.  2b).  However,  in  the 
complex  with  RAR-a,  this  ketone  oxygen  shares  an  hy¬ 
drogen  atom  with  the  hydroxyl  group  of  serine  234,  and 
forms  a  stabilizing  hydrogen  bond,  while  a  steric  clash 
occurs  in  the  other  two  RAR  isoforms,  where  residue  234 
is  an  alanine.  Consequently,  om  model  provides  a  ration¬ 
al  for  AmsSo  isoform  specificity,  suggesting  that  it  is  rel¬ 
evant  and  could  be  used  as  a  template  for  the  discovery 
of  novel  RAR-a  agonist  structures. 


A  high  throughput  virtual  screening  was  carried  out  on 
the  Available  Chemicals  Directory  (MDL  Information 
Systems,  San  Leandro,  CA),  a  compound  structure  data¬ 
base  of  over  150,000  molecules.  Each  compound  was  au¬ 
tomatically  docked  into  a  grid  representation  of  RAR-a, 
as  previously  described  for  AmsSo,  and  assigned  a  score 
according  to  the  quahty  of  the  fit  [8, 17, 20].  The  5364  lig¬ 
and  candidates  which  scored  better  (i.e.  lower)  than  -32 
kcal/mol  were  preselected  for  a  more  refined  energy 
minimization  procedure,  with  flexible  receptor  side 
chains,  and  the  energy  of  complexation  was  predicted  as 
previously  described  [21]  (see  "Materials  and  Methods"). 
After  careful  visual  examination  of  the  300  candidates 
displaying  the  lowest  predicted  binding  energy,  30  mol¬ 
ecules  were  selected  and  purchased  to  be  experimentally 
tested  in  vitro. 

HeLa  cells  were  separately  transfected  with  either  of  the 
three  wild  type  hRAR  isoforms,  and  a  AMTV-IR-CAT  re¬ 
porter  gene  [22,  23].  The  cells  were  incubated  with  each 
ligand  at  concentrations  from  50  nM  to  20  pM  to  stimu¬ 
late  CAT  activity.  Possible  toxicity  of  the  compounds  was 
deduced  from  the  amount  of  cellular  protein  extract  after 
2  days  of  incubation.  The  percentage  of  conversion  in¬ 
duced  by  1  pM  all-trans  RA  (RARs)  or  1  pM  9-cis  RA 
(RXR-p)  was  used  as  a  positive  control  to  determine  the 
maximum  induction.  Fig.  3  shows  all-trans  RA  induced 
conversion  of  RAR-a  as  a  function  of  acid  concentration. 
RAR-(3  could  induce  20%  of  the  maximum  CAT  activity 
when  activated  by  50  nM  agonist  1  (chemical  structure 
shown  Fig.  ic),  while  RAR-y  was  only  12%  active  and 
RAR-a  not  active  at  all  under  the  same  conditions  (Fig. 
4).  At  200  nM  of  agonist  l,  RAR-P  was  50%  active,  RAR- 
Y  25%  and  RAR-a  5%  active.  Similarly,  RAR-P  could  in¬ 
duce  22%  of  the  maximum  CAT  activity  when  activated 
by  50  nM  agonist  2  (chemical  structure  shown  Fig.  id). 
RAR-a  and  RAR-y  were  10%  and  14%  active  at  the  same 
concentration  of  agonist,  respectively.  At  200  nM  of  ag¬ 
onist  2,  RAR-P  displayed  48%  of  its  maximal  activity, 
RAR-a 40%  and  RAR-y  17%.  At  20  pM,  agonist  l  induced 
full  activation  of  RAR-P  and  RAR-y,  and  80%  activation 
of  RAR-a,  showing  that  this  compound  is  a  full  agonist. 
Agonist  2  was  toxic  for  the  cells  at  20  pM,  as  shown  by 
protein  content  of  cell  culture  dishes,  which  is  why  little 
transcriptional  activity  was  observed  at  this  concentra¬ 
tion.  However,  little  or  no  toxicity  was  observed  at  8  pM, 
a  concentration  at  which  agonist  2  induced  about  50% 
maximal  activity  of  RAR-a  and  RAR-p,  and  35%  maxi¬ 
mal  activity  of  RAR-y.  Agonist  l  was  not  toxic  at  20  pM. 
Finally,  RXR-P  was  activated  only  weakly  by  both  ago¬ 
nists  1  and  2  at  2  pM,  but  was  significantly  activated  by 
agonist  l  at  20  pM  (Fig.  4).  Comparison  of  Figs.  3  and  4 
also  shows  that  at  200  nM  of  agonists  1  and  2,  RAR-p  ex¬ 
hibits  the  same  activity  as  the  positive  control  RAR-a  in¬ 
duced  by  all-trans  RA  at  the  same  concentration. 
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All-trans  Retinotc  Actd  (nM) 

Figure  3 

All-trans  Retinoic  Acid  Mediates  Dose-Dependent  Transcrip¬ 
tional  Activation  of  the  IR-CAT  Reporter  by  RAR-a. 


The  predicted  structure  of  the  complex  generated  by  the 
virtual  screening  approach,  and  further  optimized  with 
flexible  receptor  side-chains  details  clearly  the  interac¬ 
tions  between  the  receptor  and  the  ligands  (Fig.  5).  Ago¬ 
nist  1  and  2  both  have  a  carboxylate  group  which 
superimposes  with  the  carboxylate  of  all-trans  RA,  and 
makes  stabilizing  hydrogen  bonds  with  Arg  274  and  278, 
and  the  backbone  nitrogen  of  Ser  289  (Fig.  5).  All  other 
receptor/ligand  interactions  are  hydrophobic.  As  a  re¬ 
sult,  the  size  and  flexibility  of  the  ligand  as  well  as  the 
shape  complementarity  with  the  receptor  are  critical  for 
affinity  and  specificity. 

The  structure  of  agonist  2  is  almost  similar  to  that  of 
Ch55  (Fig.  le),  another  RAR-P  selective  agonist  [24]  with 
0.44,  0.04  and  1.7  nM  Kd  for  RAR-a,  RAR-P  and  RAR-y 
respectively  [4].  However,  agonist  2  has  an  additional 
hydroxyl  group  on  the  aromatic  ring.  The  observation 
that  Ch55  displays  some  potency  against  the  growth  of 
human  non-small  cell  lung  carcinoma  cells  [4]  as  well  as 
anti-angiogenic  activity  [25]  suggests  that  agonist  2  is  a 
good  lead  candidate  for  cancer  therapy.  Since  this  com¬ 
pound  was  discovered  through  a  single  virtual  library 
screening,  this  is  a  very  encouraging  result  which  also 
constitutes  a  validation  of  our  method. 

The  structure  of  agonist  l  is  entirely  novel;  it  is  the  first 
RAR  ligand  described  so  far  with  either  a  thiazole  ring  or 
a  trifluoro  group.  It  presents  a  very  limited  level  of  flexi¬ 
bility  and  fits  tightly  into  the  receptor's  binding  pocket. 


This  compound  illustrates  the  benefits  of  the  in  silica 
screening  procedure,  a  rational  approach  which  is  based 
solely  on  the  structure  of  the  receptor,  and  is  not  biased 
towards  existing  ligands.  Indeed,  structure-based  drug 
design  usually  derives  novel  compound  structures  from 
ligands  previously  described.  Our  approach  enabled  the 
discovery  ab  initio  of  RAR  agonists  displaying  some  ac¬ 
tivity  at  50  nM  with  a  7%  success  rate. 

The  specificity  profile  of  the  two  RAR  ligands  described 
here  are  similar  to  that  of  Ch55,  a  compound  with  ten 
times  higher  affinity  for  RAR-P  than  RAR-a,  and  which 
can  block  the  growth  of  certain  cancer  types  [4].  It  will  be 
interesting  to  test  these  molecules  for  anti-proliferative 
activity,  both  in  vitro  and  in  vivo.  The  observation  that 
agonist  l  does  not  belong  to  any  of  the  series  of  RAR  lig¬ 
ands  described  so  far  makes  this  compound  particularly 
interesting  for  further  development. 

While  a  model  of  RAR-a  was  used  to  conduct  the  virtual 
screening,  agonists  l  and  2  have  an  EC-50  of  200  nM  for 
RAR-p,  and  an  EC-50  of  4  and  2  pM  respectively  for 
RAR-a.  This  observation  emphasizes  the  primary  goal  of 
structure-based  screening  procedures,  which  is  to  signif¬ 
icantly  accelerate  the  initial  steps  of  lead  identification 
projects  by  automatically  discriminating  between  bind¬ 
ers  and  non-binders,  rather  than  ranking  ligands  accord¬ 
ing  to  their  affinity  for  the  receptor.  The  discovery  of  2 
RAR-a  agonists  out  of  30  molecules  tested  is  a  good  illus¬ 
tration  of  this  approach. 

Conclusions 

This  report  details  the  rapid  discovery  of  RAR  agonists 
with  novel  structural  features,  thanks  to  a  powerful  vir¬ 
tual  ligand  screening  approach,  and  a  research  strategy 
where  considerations  on  existing  ligands  are  avoided. 
One  of  the  molecules  presented  here  constitute  a  good 
framework  for  the  development  of  a  novel  series  of  RAR 
ligands  very  different  from  all  structures  described  so 
far.  Such  ligands  could  present  more  favorable  specificity 
and  toxicity  profiles,  and  have  important  applications  in 
cancer  therapy. 

Materials  and  Methods 

Modeling  of  RAR-alpha  ligand  binding  pocket 

The  erystal  structure  of  RAR-^-LBD  complexed  to  all- 
trans  RA  was  used  as  a  template  [18]  and  the  three  resi¬ 
dues  in  the  vicinity  of  the  ligand  which  are  not  conserved 
between  the  two  isoforms  were  modified  accordingly: 
A234,  M272,  and  A397  were  changed  to  S234, 1272  and 
V397  respectively.  The  rotation  variables  of  the  side 
chains  within  3.5  A  of  the  modified  residues  were  unfixed 
and  the  energy  of  the  system  was  minimized  in  the  inter¬ 
nal  coordinate  space,  according  to  the  ICM  method  [26, 
29]. 


BMC  Structural  Biology  (2001)  1:1 


A^onlat  1 


Ligand  Concentration 


http://www.  biomedcentral. com/1 472-6807/1  /I 


Ligand  Concentration 


Figure  4 

In  vitro  activity  of  the  novel  RAR  agonists.  HeLa  cells  were  separately  transfected  with  a  vector  expressing  each  isoform  of  the 
full-length  receptor  and  a  AMTV-IR-CAT  reporter  gene.  The  CAT  activity  induced  by  each  iigand  at  concentrations  from  50 
nM  to  20  p,M  was  measured.  The  activity  induced  by  the  natural  hormone  all-trans  RA  at  I  p,M  was  used  as  the  reference  for 
maximal  induction  for  RARs,  and  I  pM  9-cis  RA  was  used  for  RXR-P.  The  activity  plotted  is  the  observed  percentage  of  this 
maximal  induction. 


Docking  ofAMSSO  Into  RAR-alpha 

The  flexible  ligand  was  docked  into  a  combination  of  five 
potential  map  representations  of  the  RAR-a  ligand  bind¬ 
ing  pocket,  which  account  for  two  Van  der  Waals  bound¬ 
aries,  hydrophobicity,  electrostatics  and  hydrogen 
bonding  profiles  [20,27].  This  rapid  docking  procedure 
was  followed  by  a  more  refined  energy  minimization  of 
the  complex,  with  a  full  atom  representation  of  the  re¬ 
ceptor,  and  flexible  receptor  side  chains,  according  to  the 
ICM  stochastic  global  optimization  algorithm  [26, 27]  as 
implemented  in  the  Molsoft  ICM  2.7  program  [17]. 

Virtual  screening  of  the  compound  structure  database 

The  ICM  program  [17]  was  used  to  perform  both  receptor 
modeling  and  virtual  ligand  screening.  The  procedure 
followed  was  similar  than  previously  described  [8]:  each 
flexible  ligand  of  the  Available  Chemicals  Directory 
(MDL  Information  Systems,  San  Leandro)  was  docked 
automatically  into  the  combination  of  potential  maps  de¬ 
scribed  above,  and  assigned  a  score  according  to  its  fit 


with  the  receptor.  The  scoring  function  included  contin¬ 
uum  as  well  as  discreet  electrostatics,  hydrophobicity 
and  entropy  parameters  [20].  The  screening  of  the  data¬ 
base  of  over  150,000  ligands  took  less  than  a  month  on 
10  "194  MHZ  IP25’'  processors.  Using  the  same  comput¬ 
ing  power,  the  5364  compounds  which  scored  better  (i.e. 
lower)  than  -32  were  preselected  for  a  second  automatic 
round  of  selection:  they  were  all  docked  in  two  days  into 
a  full  atom  representation  of  the  receptor,  with  flexible 
receptor  side  chains,  according  to  a  global  energy  optimi¬ 
zation  in  internal  coordinates  [17,  26,27,28].  The  struc¬ 
ture  generated  by  the  initial  library  screening  was  used 
as  a  starting  point  for  this  second,  more  refined,  docking 
procedure.  The  binding  energy  of  the  compounds  was 
then  evaluated  with  a  boundary  elements  implementa¬ 
tion  of  solvation  electrostatics  [21]  and  the  300  com¬ 
pounds  showing  the  lowest  binding  energy  were  selected 
for  further  examination.  After  careful  visual  inspection 
for  shape  complementarity,  hydrogen  bonding  network, 
compound  flexibility,  and  potential  Van  der  Waals  clash- 


Figure  5 

Structure  of  the  novel  RAR  agonists.  Agonists  I  and  2  are  shown  (respectively  A  and  B).  Left:  chemical  structure  of  the  com¬ 
pounds.  Right:  Representation  of  the  compounds  docked  into  the  binding  pocket  of  RAR  (important  residues  are  displayed  as 
sticks:  R274,  R278,  S289),  and  superimposed  with  the  crystal  structure  of  all-trans  RA  (green).  The  receptor  is  represented  as 
a  white  ribbon.  Hydrogens  are  not  displayed  for  clarity.  Color  coding:  carbons,  oxygens,  nitrogens,  sulfurs,  fluorides  and  hydro¬ 
gens  are  colored  white,  red,  blue,  yellow,  magenta  and  gray  respectively. 


es,  30  molecules  out  of  300  were  selected  and  purchased 
to  be  experimentally  tested  in  vitro.  We  should  stress  out 
here  that  the  Van  der  Waals  term  is  too  noisy  to  be  added 
to  the  binding  energy  function,  as  discussed  in  details 
elsewhere  [21].  As  a  consequence,  some  compounds  with 
a  low  predicted  binding  energy  were  not  retained,  be¬ 
cause  they  seemed  likely  to  clash  sterically  with  the  re¬ 
ceptor. 

Biological  activity  of  the  ligand  candidates 

HeLa  cells  were  transfected  by  calcium  phosphate  pre¬ 
cipitation  using  1  pg  of  plasmid  expressing  the  full  length 
receptor  isoform  and  1  pg  of  a  AMTV-IR-CAT  reporter 
gene,  as  previously  described  [22, 23].  Ligands  were  dis¬ 
solved  in  DMSO  at  20  mM  final  concentration.  Cell  cul¬ 


tures  were  supplemented  with  indicated  ligands 
immediately  after  addition  of  the  calcium  phosphate/ 
DNA  precipitate.  Media  and  ligands  were  replaced  after 
24  h  and  cells  were  harvested  and  assayed  for  CAT  activ¬ 
ity  24  h  later:  acetylated  and  unreacted  [^‘^C]  chloram¬ 
phenicol  was  excised  from  a  thin  layer  chromatography 
plate  and  quantitated  in  a  liquid  scintillation  counter. 
The  amount  of  cellular  protein  extract  after  two  days  of 
incubation  was  measured  to  determine  the  compounds 
toxicity.  Agonist  1  was  purchased  from  Bionet  Research 
(catalog  number  1G-433S).  Agonist  2  was  purchased 
from  Sigma-Aldrich  (Sigma  Aldrieh  library  of  rare  chem¬ 
icals.  Catalog  number  S08503-1). 
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